Data Labeling Fellow & Reviewer
As a Data Labeling Fellow & Reviewer at Handshake AI, I supported complex AI data and evaluation workflows by conducting research and expert review for prompt development and model assessment. I evaluated large language model (LLM) outputs for accuracy, reasoning quality, and depth across text, audio, and visual tasks. I provided structured feedback and escalations on edge cases to improve model understanding and quality. • Conducted evaluations of text, audio, and visual tasks for LLM outputs • Provided feedback to refine evaluation standards and annotation guidance • Partnered with leads to create quality criteria for Fellow workflows • Drove measurable improvements in AI model output consistency and quality