Cypher Evals IT
Assess, classify, and rate the output of the LLM to prompts that I did not write. Focus on instruction following, localization, truthfulness, and conciseness.
Hire this AI Trainer
Sign in or create an account to invite AI Trainers to your job.
No subject matter listed
I have months of hands-on experience in data labeling and AI training, primarily focusing on language-based assessment. Thanks to my experience with Scale AI, I have now extensive expertise in providing detailed prompts to large language models (LLMs) and evaluating their outputs based on instruction-following, truthfulness, writing quality, and localization. My expertise focuses on categorizing responses in these areas to ensure the model adheres to specified guidelines. While my primary experience has been in language-based tasks, I also have experience assessing outputs in math and coding. This background, together with my detailed-oriented approach, allows me to contribute effectively to the labeling process and ensure high-quality training data for AI models.
Assess, classify, and rate the output of the LLM to prompts that I did not write. Focus on instruction following, localization, truthfulness, and conciseness.
Write math exercises as prompts to the LLM and then assess, classify, and rate its responses based on truthfulness, instruction following, localization, and conciseveness
Writing elaborated and localized prompts (creative writing, open QA, closed QA with a reference text, extraction of information from a reference text, summarization, etc) and assess the LLM's outputs based on instruction following, localization, writing quality, truthfulness, conciseness, harmfulness.
2-year Master degree, Political science, International Relations, European Studies
Contributor
Researcher