AI Trainer & Data Annotator
As an AI Trainer & Data Annotator at Outlier (Scale AI), I evaluated and ranked large language model responses on finance, economics, and general reasoning tasks. I assessed outputs for factual accuracy, logical consistency, instruction adherence, and provided detailed written feedback for RLHF pipelines. The work involved writing reference answers, preference labels, and detecting subtle model errors under volume-based workflows. • Evaluated LLM outputs for accuracy, coherence, and instruction-following • Labeled LLM responses to support RLHF model training • Applied domain expertise to identify nuanced financial and quantitative errors • Produced actionable written rationales and high-quality reference content