AI Evaluator / RLHF Contributor (Independent Contractor)
As an AI Evaluator and RLHF Contributor, I evaluated AI-generated responses for accuracy, safety, and helpfulness. I systematically identified and documented hallucinations, logical flaws, and reliability issues in large language models. My responsibilities included applying scoring rubrics and providing detailed feedback to engineering teams. • Executed remote evaluations across multiple domains • Used structured guidelines to ensure objective and consistent ratings • Delivered concise justifications for ratings and feedback • Supported model safety and performance improvement efforts.