AI Evaluation Expert & Independent Contractor
As an AI Evaluation Expert at Mercor AI, I performed data labeling and annotation of AI-generated outputs across text, audio, and video domains. My work involved evaluating model responses for accuracy, reasoning, and contextual relevance using structured annotation guidelines. I conducted side-by-side comparisons, assigned quality ratings, and identified factual and logical errors to improve model performance. I contributed to annotation workflows across domains including sports analytics, general knowledge, audio analysis, and video captioning. Tools used: Mercor Studio, Airtable, Parimango, Slack, Insightful.