Cypher Evals
In the Cypher Evals project, I specialized in evaluating pre-generated AI responses based on six key criteria: instruction following, truthfulness, writing quality, verbosity, localization, and harmlessness. Unlike Cypher RHLF, my role in this project focused solely on assessing and providing feedback for existing responses rather than creating new prompts. This required a critical eye for detail and linguistic expertise to ensure that the responses met the highest standards in French, English