AI Content Lead & Evaluation Specialist
As AI Content Lead & Evaluation Specialist at Turing, I managed an AI team focused on reviewing and annotating training data for LLMs. I evaluated model outputs using the 4-axis framework, providing feedback for reasoning, factuality, and safety. I developed prompt scenarios to test and enhance model guide-writing and workflow capabilities. • Oversaw QA of LLM prompts and responses for high-accuracy data deliveries. • Conducted detailed annotations to improve model alignment and safety. • Designed workflows to accelerate review cycles by 45%. • Created complex multi-turn prompt interactions for robust scenario evaluation.