Multi-Modal AI Data Annotation for Object Detection & Audio Classification
Project Description Worked on a large-scale video annotation project designed to support the development of advanced computer vision models for object detection and tracking. The project focused on accurately labeling dynamic scenes to improve model performance in real-world environments. My responsibilities included frame-by-frame video annotation using bounding boxes and polygon segmentation, tracking multiple objects across sequences, and labeling over 15 object categories such as vehicles, pedestrians, and environmental elements. I also applied keypoint annotations for motion and pose estimation tasks where required. The project involved processing thousands of video clips and annotating over 100,000 frames. I followed strict annotation guidelines to ensure consistency and precision across sequences. Quality assurance measures included double-review checks, IoU (Intersection over Union) validation, temporal consistency verification for tracking tasks, and continuous feedback imp