Senior Data Content Specialist (AI Training & RLHF)
Regularly contributed to training and optimizing AI language models through the application of Reinforcement Learning from Human Feedback (RLHF). Evaluated and rated AI-generated text outputs to ensure they met strict logical, grammatical, and safety criteria. Developed and refined gold standard rubrics used for internal consistency and accuracy in training data labeling tasks. • Assessed nuanced, multi-turn dialogues for instruction-following, factuality, and hallucination detection. • Coordinated with cross-functional teams to implement automated tooling that enhanced data labeling precision. • Maintained exacting standards in reviewing, classifying, and providing feedback on model outputs. • Enabled ongoing model improvement through critical content review and performance analytics.