AI Data Labeler / LLM Evaluator
In this role, I evaluated and ranked large language model (LLM) outputs according to established guidelines. I performed data annotation of text, including categorization, safety assessment, and relevance tagging. My responsibilities also included detailed feedback provision to enhance training data quality. • Conducted logical accuracy checks and policy compliance reviews • Labeled text for instruction adherence and fact-checking • Identified inconsistencies and instruction-following failures • Utilized structured evaluation rubrics for consistent task performance.