AI Model Evaluation Specialist (Meta Project via RWS)
As an AI Model Evaluation Specialist on the Meta Project via RWS, I was responsible for assessing AI-generated outputs for their quality and accuracy. My work included detailed analysis, benchmarking model performance, and providing structured feedback to guide model improvement. The position required strict adherence to quality guidelines and precise documentation of results. • Evaluated AI model outputs across multiple domains for technical accuracy and logical reasoning. • Compared alternative responses to identify inconsistencies, hallucinations, and edge-case issues. • Tagged and categorized content by topic, difficulty, and correctness for benchmarking. • Delivered structured feedback using evaluation rubrics to improve AI model performance.