AI Model Evaluation & Arabic NLP Data Labeling
Worked on evaluating AI-generated responses in Arabic, assessing linguistic accuracy, contextual understanding, and coherence. Designed and refined prompts to enhance AI performance in Arabic NLP tasks, ensuring better alignment with user expectations. Provided structured feedback and fine-tuning recommendations to improve model behavior. Focused on maintaining high-quality annotations and following strict evaluation guidelines.