Freelance AI Trainer / Data Labeler
As a Freelance AI Trainer and Data Labeler, I performed extensive RLHF (Reinforcement Learning from Human Feedback) tasks to refine machine learning models. This involved evaluating, ranking, and providing detailed feedback on AI-generated conversational outputs across multiple annotation projects. My work maintained high accuracy throughout several quality audits while supporting the training of conversational AI systems. • Evaluated and rated AI-generated responses for accuracy, helpfulness, and safety. • Applied complex annotation guidelines to nuanced intent and sentiment categorization tasks. • Collaborated with quality teams to report edge cases and improve annotation guidelines. • Maintained accuracy rates above 95% during multi-round audits.