AI Product Evaluation Specialist
As an AI Product Evaluation Specialist, I designed and executed structured evaluation frameworks to assess LLM-generated model outputs. I evaluated AI responses based on reasoning quality, instruction adherence, and hallucination risk, handling over 100 model responses per week. I also implemented feedback loops to improve model performance through RLHF cycles and prompt optimization. • Developed structured evaluation rubrics for LLM output quality. • Conducted quality audits and compliance validation against specifications. • Identified recurring failure patterns to inform optimization strategies. • Enhanced RLHF training cycles with actionable feedback for production models.