Arabic LLM evaluator
I specialize in evaluating large language model (LLM) outputs, with a focus on code generation accuracy and Arabic-language response quality. My work involves analyzing and validating model-generated code for correctness, efficiency, and real-world usability. I'm also experienced in crafting effective prompts and assessing Arabic responses for fluency, coherence, and cultural relevance. With a strong bilingual background and hands-on coding skills, I bridge the gap between technical precision and linguistic nuance in LLM evaluation workflows.