AI Training Data Specialist
Conducted RLHF (Reinforcement Learning from Human Feedback) for training large language models. Tested and ranked the AI-suggested responses on accuracy, usefulness, and safety. Rated model performance on various domains such as coding, creative writing, and technical writing. Rated the quality of responses and provided feedback for model development. Scored above 95% on quality ratings for 10,000+ tasks on accuracy.