Multi-Turn AI Chat | Rating Task (Arabic-Moroccan)
In this role, I was responsible for reviewing multi-turn AI chat conversations, carefully reading all exchanges between the user and the AI to evaluate the quality of responses. At the end of each session, I analyzed two AI-generated answers, rated them based on predefined quality metrics such as coherence, accuracy, relevance, and fluency, and selected the response that best fit the context. In addition, I refined low-quality responses when necessary to improve conversational flow and engagement. My contributions helped enhance AI language models, making them more natural, accurate, and effective for real-world interactions