AI Evaluation Specialist
Conduct structured LLM response evaluation applying the ICON-REAL framework across seven quality dimensions: Understanding, Completeness, Accuracy, Non-contradiction, Relevance, Conciseness, and Clarity. • Assign severity ratings to annotation issues and produce sourced factual error claims with citation references for model improvement. • Execute RLHF preference ranking tasks across bilingual Arabic–English batches under strict Phase II project specifications. • Maintain consistent quality scores across high-volume evaluation cycles with low revision and rejection rate.