AI Response Evaluator & Linguistic Reviewer
As an AI Response Evaluator & Linguistic Reviewer at OneForma (Centific), I evaluated AI-generated Arabic responses using structured quality and safety rubrics. My work focused on ranking, quality review, and improving naturalness, empathy, and cultural accuracy of conversational model outputs. I ensured labeling accuracy and maintained workflow consistency while adhering to strict evaluation protocols. • Applied comparative assessment and ranking of multiple LLM outputs for clarity, empathy, and contextual relevance. • Identified unsafe suggestions, tone inconsistencies, and minimization language in conversational responses. • Refined AI outputs to enhance naturalness and ensure Saudi cultural appropriateness. • Completed multiple certification tracks, including LE, PR, and QA workflows.