Arabic LLM Prompt Evaluation & Text Labeling Project
Worked on large-scale Arabic language data labeling projects for LLM development, focusing on text quality assessment, intent classification, prompt/response pair evaluations, and NER tasks. I evaluated generated text outputs for fluency, correctness, and cultural alignment, helping to improve the accuracy of AI systems trained in Arabic. The project included daily task batches exceeding 1,000 items, with stringent QA protocols and alignment to linguistic standards across Modern Standard Arabic and Gulf/North African dialects. I consistently maintained high annotation accuracy and met fast turnaround times, contributing to model fine-tuning for multilingual NLP.