AI Training Generalist | Outlier AI
As an AI Training Generalist at Outlier AI, I evaluated and annotated AI-generated responses for factual accuracy, safety, and adherence to user prompts. My responsibilities included applying complex project guidelines and providing structured reinforcement learning from human feedback (RLHF). I consistently ensured that AI outputs complied with detailed instructions and maintained high data quality standards. • Evaluated a wide variety of AI-generated text responses across diverse subjects • Assessed output for factual integrity, prompt adherence, and safety concerns • Used proprietary/internal tooling for annotation and feedback • Contributed to the ongoing refinement of models through expert evaluation