Omni
I contributed to Project Omni, a data labeling initiative focused on evaluating conversational AI responses to enhance natural language processing (NLP) models. My role involved analyzing user prompts and comparing two AI-generated responses (A and B) using the Labelbox platform. For example, I evaluated responses to a prompt about challenges in creating fusion cuisine, ensuring alignment with the AI persona (a chef specialized in Indian fusion cuisine). I tagged prompts, selected the better response based on relevance and accuracy, and rated both responses for language quality, including grammar, fluency, and consistency, achieving a 95% accuracy rate in my annotations. This project honed my attention to detail and deepened my understanding of conversational AI evaluation, directly supporting the improvement of AI-driven dialogue systems.