AI Model Training & Evaluation Specialist - DataAnnotation.tech
On the DataAnnotation.tech platform, I specialized in comprehensive Reinforcement Learning from Human Feedback (RLHF) for Large Language Models (LLMs) across multiple modalities. My core objective was to refine model performance, safety, and alignment through meticulous evaluation and high-quality data generation.