AI Training & Evaluation Contributor (Remote Contract)
As an AI Training & Evaluation Contributor, I evaluated AI-generated responses for factual accuracy, consistency, and safety in a remote, contract-based role. My work involved annotating datasets with structured data points and applying standardized scoring rubrics. I contributed to identifying unsafe medical advice and maintained high inter-rater reliability across all evaluation tasks. • Evaluated over 2,000 AI-generated responses for accuracy and safety • Annotated more than 50,000 structured data points to enhance model performance • Maintained 97-99% inter-rater reliability in all evaluation activities • Documented all findings in compliance with audit-ready reporting protocols