Senior AI Content Specialist
As a Senior AI Content Specialist, I led the evaluation of AI-generated large language model responses to ensure truthfulness, helpfulness, and harmlessness. My work focused on performing RLHF by ranking multiple AI outputs, writing complex prompts, and collaborating directly with machine learning engineers to drive improvements. I also played a key role in maintaining a 98% quality score across performance audits.• Conducted side-by-side and model ranking evaluations adhering to strict guidelines. • Crafted and refined challenging prompts to rigorously test LLM performance and bias. • Regularly provided detailed feedback and reasoning to optimize LLM decision processes. • Coordinated quality initiatives and contributed to cross-functional communication.