AI Data Specialist (Outlier)
As an AI Data Specialist at Outlier, I applied Reinforcement Learning from Human Feedback to enhance model learning and response quality. I methodically evaluated AI outputs for safety, truthfulness, harmlessness, and policy compliance. I performed prompt engineering and quality assurance on analyst work, focusing on prompt and instruction-following tasks. • Conducted detailed review of model prompts and responses • Assessed conversational and instructional data for alignment • Evaluated and marked outputs for policy compliance • Contributed feedback to improve model safety and performance