Generalist Expert
As a Generalist Expert, I evaluated AI model outputs across multiple tasks for reasoning, clarity, and accuracy. The work involved assessing response pairs and providing ratings to support AI system improvement. Judgments were structured and followed specific project criteria for consistency. • Compared and rated AI-generated text responses • Identified instruction adherence and quality • Documented feedback for model improvement • Used project-specific internal tooling to complete reviews