AI & Data Quality Specialist (Independent)
Evaluated and ranked thousands of AI-generated responses in Mexican Spanish across multiple LLM platforms. Tasks included RLHF preference ranking, factual accuracy verification, cultural adaptation review, and red-teaming exercises. Developed annotation rubrics for Spanish-language NLP tasks including sentiment analysis, entity recognition, and text classification. Focused on identifying edge cases, biases, and failure modes specific to Latin American Spanish. Delivered consistent 95%+ inter-annotator agreement scores on quality metrics.