AI Data Trainer (RLHF)
As an AI Data Trainer at Turing, I worked on Reinforcement Learning with Human Feedback (RLHF) projects. My primary responsibility was to provide feedback and ratings for AI-generated outputs to fine-tune large language models. I ensured high-quality and relevant feedback to improve AI model performance in understanding and generating human-like text. • Provided accurate and consistent feedback on various text outputs from AI systems. • Followed detailed instructions to evaluate and rate outputs for relevance, accuracy, and creativity. • Collaborated with a remote team to align evaluation standards and reporting. • Contributed to ongoing advancements in AI safety and alignment tasks.