Business Analyst – Conversational AI Evaluation (Turing)
This role required comprehensive side-by-side evaluations of conversational AI model outputs for quality and accuracy. RLHF evaluations were conducted to enhance model performance via systematic response comparison. Rigorous fact-checking and structured justifications were integral to iterative model improvement. • Evaluated AI model responses for alignment with user intent and relevance. • Applied established guidelines to provide detailed quality assessments. • Utilized browser-based analytical tools and command-line workflows. • Maintained high standards for consistency and benchmark adherence.