Remote AI Content Specialist & Data Annotator
• Performed RLHF (Reinforcement Learning from Human Feedback) evaluations and data annotation on large language models with a focus on quality and safety. Carried out Side-by-Side model output comparisons, preference ranking, and long-form response evaluations across diverse content domains. • Authored high-quality golden responses and prompts to serve as exemplary training data standards. • Designed and applied custom rubrics for grading and quality control • Executed adversarial testing (red teaming) to identify model vulnerabilities • Conducted structured data analysis and rigorous fact-checking of AI-generated outputs • Adapted to evolving project requirements, including foundational code review and advanced linguistic tasks