Senior Quality Ratter
Contributed to the development and refinement of state-of-the-art Large Language Models (LLMs) through Reinforcement Learning from Human Feedback (RLHF). Collaborated with AI research teams to improve model accuracy, safety, and alignment by generating high-quality training data and evaluating model outputs against strict performance criteria.