Senior AI Engineer – Data Labeling/Evaluation Lead
Oversaw AI evaluation and annotation workflows for chatbot and LLM outputs. Maintained high schema compliance in text, image, and audio projects, diagnosing bias and documenting edge cases for model improvement. Summarized QA defects to prioritize fixes in sentiment, intent, and classification pipelines. • Scored relevance, tone, and factual consistency in LLM responses • Validated labels in spreadsheets and SQL extracts • Executed multilingual adversarial prompt testing • Improved inter-rater agreement through rubric updates