Multimodal Data Annotation for Computer Vision and NLP Tasks using Label Studio
Worked on multiple data annotation and labeling projects involving image, text, and audio datasets using Label Studio, CVAT, and LabelImg. Key responsibilities included: Annotating images with bounding boxes, polygons, and segmentation masks for object detection and image segmentation models. Performing Named Entity Recognition (NER) and text classification for NLP and LLM training datasets. Labeling audio data with transcription and speech segmentation for ASR model training. Ensuring high-quality annotations by following strict labeling guidelines and performing quality assurance checks. Handling large-scale datasets (5,000+ samples) while maintaining >98% annotation accuracy. The project focused on building high-quality training datasets for machine learning, deep learning, and large language models (LLMs).