AI Training & Multi-Modal Data Annotation Specialist
Led multi-modal AI data annotation projects involving image, video, audio, and text datasets for machine learning and deep learning model training. Performed bounding box, polygon, and semantic segmentation for object detection models, including YOLO-based systems. Conducted frame-by-frame video annotation and object tracking for surveillance and traffic monitoring datasets. Annotated and transcribed multilingual audio datasets, including speaker identification and emotion recognition tasks. Completed Named Entity Recognition (NER) and text classification for NLP model training. Managed datasets ranging from 10,000+ images to 5,000+ video sequences and 1,000+ hours of audio recordings.