AI Linguistic Evaluation - Self-Initiated Project
I participated in the evaluation and quality assurance of Arabic-language outputs from Large Language Models (LLMs). Leveraging my expertise in Modern Standard Arabic, I assessed AI-generated text for factual accuracy, grammar, and naturalness. My primary responsibilities included documenting errors and providing constructive feedback to enhance linguistic accuracy. • Identified and documented hallucinations, unnatural phrasing, and grammatical errors • Refined prompts to improve model performance in Arabic • Conducted regular reviews to ensure cultural and stylistic appropriateness • Compiled structured reports used for iterative LLM improvements