Freelance AI Trainer & Data Labeler (RLHF Evaluator)
I evaluated chatbot responses for clarity, relevance, and compliance during reinforcement learning from human feedback (RLHF) tasks. My feedback cycles directly contributed to improvements in large language model (LLM) performance. I consistently provided structured feedback that enhanced user experience and reduced error rates. • Rated and reviewed LLM-generated responses in chat settings • Checked for compliance with project-specific guidelines • Identified and escalated problematic or unclear chatbot outputs • Helped optimize LLM responses through targeted feedback