Bilingual AI Data Localization
In this role, I evaluated and validated structured and unstructured outputs from large language models in English and Portuguese. My responsibilities included assessing the correctness, alignment, and linguistic reliability of multimodal outputs, including JSON-based responses. I contributed to data annotation, consistency checking, and dataset curation within side-by-side comparison and QA workflows. • Conducted evaluation of text, image, audio, and video outputs for hallucinations and instruction-following reliability. • Performed criteria-based validation and surfaced quality regressions and edge cases. • Improved annotation consistency and reported workflow/tooling limitations. • Enhanced evaluation reliability and contributed to deployment readiness.