AI Training Data Specialist / Annotator
As an AI Training Data Specialist/Annotator at Outlier AI, I specialized in Reinforcement Learning from Human Feedback (RLHF) to improve model accuracy and safety. My work involved prompt engineering, response evaluation, and hallucination detection for large language models. I performed comprehensive data labeling tasks, audio transcription, and multimodal annotation. • Wrote and refined high-quality prompts to guide AI responses. • Evaluated and rated AI-generated outputs for correctness and logic. • Identified and flagged hallucinations and errors in responses. • Performed audio-to-text transcription for multimodal training sets.