AI Training Data Annotation & Prompt Engineering for Multilingual LLMs
I am currently contributing to a high-volume NLP project supporting the development of multilingual large language models (LLMs) for global clients. To date, I have annotated over 5,000 linguistic samples monthly in Mexican Spanish, consistently maintaining a 98.7% precision rate. I translate and generate culturally adapted prompt sets leveraging regional expressions and sociolinguistic context, resulting in a 22% measured improvement in model accuracy for Mexican Spanish users. I also designed standardized annotation protocols that reduced team inconsistencies by 35%, and collaborate directly with ML engineers to refine guidelines, accelerating task completion by 15% while preserving quality. All work follows strict ethical, privacy, and bias-aware standards, including responsible handling of sensitive content. This ongoing role demonstrates my ability to deliver scalable, culturally grounded training data that enhances AI performance in real-world applications.