Advanced AI Trainer
I worked on an AI evaluation and benchmarking project for a confidential client, comparing the performance of two conversational AI systems through voice-based and text interactions. My tasks included generating prompts, conducting parallel system evaluations, assigning numerical ratings, and writing detailed qualitative explanations and comparative analyses based on criteria such as accuracy, reasoning quality, linguistic clarity, responsiveness, and overall user experience. I followed strict evaluation guidelines and quality assurance standards to ensure consistent, objective, and high-quality assessments.