Senior AI Data Specialist — AI Model Evaluation
Evaluated AI-generated responses for accuracy, clarity, and logical consistency as part of a large-scale remote model evaluation team. Compared and ranked various outputs, identifying gaps or hallucinations and providing feedback to enhance AI system performance. Conducted quality assurance on datasets, correcting annotation inconsistencies in accordance with detailed guidelines. • Tested model performance across multiple subject areas and modalities • Provided structured written feedback to guide prompt improvements • Reviewed responses for factual, logical, and linguistic accuracy • Worked independently while meeting project and quality targets.