Cypher Evals
In this project I rated two model generated responses separately across various dimensions. Provided preference ranking scores. Wrote a justification for the response preference ranking and explained the specifics and logic behind my decision.