AI Training & RLHF Specialist
Served as a specialist for multiple AI model training projects focused on providing human feedback to refine model outputs. Delivered constructive evaluations to enhance the performance of large language models and guided improvements through reinforcement learning from human feedback (RLHF). Collaborated with teams to ensure data quality and maximize model learning efficiency. • Provided consistent and high-quality ratings for AI-generated text outputs. • Utilized RLHF and EVAL frameworks for content evaluation and model training. • Supported AI training by flagging errors, ambiguities, or undesired model behaviors. • Participated in iterative feedback cycles to optimize AI model responses.