AI Model Development & Evaluation Specialist
As an AI Model Development & Evaluation Specialist at Outlier AI, I evaluate and refine large language model outputs. My role involves assessing accuracy, reasoning quality, and task alignment for a variety of tasks. I design and optimize prompts to enhance response quality and consistency across different task types. • Evaluate text-based LLM outputs for accuracy and relevance. • Provide structured improvement feedback to model developers. • Design effective prompts to test and improve LLM behavior. • Work within global teams to maintain high-quality evaluation standards.