AI Training & Evaluation Generalist (Contractor) at Outlier.ai
Worked as an AI Training & Evaluation Generalist conducting structured evaluations of AI responses. Performed comparative judgment tasks and executed audit workflows to spot reasoning or factual issues in model outputs. Completed analysis tasks across general knowledge, technical, and real-world domains while maintaining quality. • Evaluated and scored AI model outputs based on accuracy and relevance. • Performed preference ranking and provided rationales for judgments. • Audited model outputs for reasoning and policy alignment issues. • Handled diverse subject matter for AI evaluation based on guidelines.