AI Data Labeler / Model Evaluation Analyst
As an AI Data Labeler and Model Evaluation Analyst, I labeled and evaluated AI-generated text for large language model (LLM) training. I applied detailed rubrics to assess model outputs, identified hallucinations and factual inaccuracies, and provided clear written justifications for my ratings. This role required strict adherence to complex annotation guidelines and a high level of attention to detail. • Labeled and reviewed AI-generated text responses for quality and accuracy • Ranked and classified LLM outputs using evaluation frameworks • Identified hallucinations, errors, and provided rationales for ratings • Ensured all annotation followed project-specific guidelines and standards