AI prompt response evaluation
The CherryOpal AI Prompt Response Evaluation project focused on refining AI-generated responses for an e-commerce LLM. Tasks included entity recognition (NER) and classification, evaluating response accuracy, coherence, and relevance, and rewriting prompts for supervised fine-tuning (SFT) to enhance AI interactions. Using Prodigy, I labeled thousands of text responses, ensuring high consistency. Adhering to strict quality guidelines, I maintained precision, contextual accuracy, and industry standards to optimize AI-driven shopping experiences. The project aimed to improve search relevance and response quality, ensuring AI-generated interactions met user expectations with high linguistic and contextual integrity across large-scale datasets.