Multimodal AI Specialist – Video & Visual Data Classification
This project focused on the classification of video and visual data to support multimodal large language model (LLM) development. My role involved reviewing video content and annotating it based on predefined classification criteria such as scene type, action categories, sentiment, and relevance to associated textual prompts. Using a proprietary labeling platform, I handled a high volume of data while maintaining strict quality control through regular audits, inter-rater consistency checks, and continuous guideline updates. The labeled data contributed to training and aligning LLMs to better understand and respond to visual inputs.