Multimodal Image-Text QA Validation & Annotation for Enterprise AI Systems
Engaged in massive multimodal data labeling and validation for enterprise-level AI training pipelines related to image-based question answering and visual understanding. Tasks included validating image-text pairs, confirming the accuracy of extracted text, correcting incorrectly labeled objects and scenes, and validating visual-text consistency. Performed classification, answer rewriting, and quality assessment within tight annotation requirements. Identified ambiguous and unanswerable questions with rationale and used structured assessment frameworks to ensure dataset quality. Monitored more than 30,000 items while meeting production requirements of 7-8 minutes per item and 98% quality adherence.