Freelance Data Annotator & AI Training Specialist
I carried out RLHF (Reinforcement Learning from Human Feedback) and AI response evaluation tasks such as rating and ranking AI-generated outputs. This work involved judging outputs for factual accuracy, tone, coherence, and safety in alignment with client and task guidelines. My efforts directly contributed to fine-tuning and improving large language models. • Rated and ranked AI model outputs for quality and alignment • Compared and evaluated various AI responses in real-world scenarios • Ensured data labeling met high accuracy and consistency metrics • Provided actionable feedback for AI model improvement