Data Labeling - Project
Labelled brand logos on un stable platforms like player jersey, helmet, and surfaces from live video footage for fine tuning vision models
Hire this AI Trainer
Sign in or create an account to invite AI Trainers to your job.
No subject matter listed
I have over six years of experience in data science and AI, with a strong focus on building and optimizing machine learning and NLP solutions. My background includes designing end-to-end data pipelines, developing and deploying models for natural language processing, computer vision, and data analytics, and ensuring high-quality training data through robust preprocessing, validation, and annotation strategies. I am proficient in Python, PyTorch, TensorFlow, and cloud platforms like AWS and Azure, and have hands-on experience with tools such as NLTK, SpaCy, Hugging Face, and OpenCV. My projects have ranged from reviewer recommendation systems using BERT and NER to multi-agent AI frameworks for medical appointment booking, where data integrity and accurate labeling were critical. I am passionate about leveraging my expertise to deliver reliable, scalable AI solutions powered by high-quality, well-annotated data.
Labelled brand logos on un stable platforms like player jersey, helmet, and surfaces from live video footage for fine tuning vision models
As a Sr. Associate Technology - AI/ML/Data Science at Synechron, I curated diverse datasets for financial NLP tasks and defined strict annotation guidelines to minimize labeler ambiguity. I designed data ingestion workflows for multi-agent systems, ensuring high-quality ground truth for financial decision models. Automated data validation checks were implemented within Airflow pipelines to detect drift and schema anomalies before labeling. • Led definition of annotation schemas for NLP classification of financial data • Implemented workflow automation to ensure label quality and consistency • Minimized ambiguity via detailed labeling policies and continuous feedback • Oversaw end-to-end curation, validation, and preparation of training datasets
As a Data Scientist at 3K Technologies Pvt. Ltd., I oversaw the annotation of large-scale corpora for entity extraction and summarization in specialized domains. I developed semi-automated labeling pipelines using Large Language Models (LLMs) to pre-label text, reducing manual annotation time substantially. I collaborated with subject matter experts to refine the taxonomy and ontology for medical and business datasets. • Directed team annotation projects for high-volume NER and classification tasks • Built and optimized pre-labeling pipelines leveraging LLM technology • Addressed class imbalance with synthetic data generation strategies • Ensured data quality through expert review and iterative label refinement
At Trendzlink Technologies Pvt. Ltd., I managed end-to-end image annotation workflows for object detection projects using Roboflow. Using advanced augmentation techniques, I expanded training datasets and improved model generalization. My work on quality assurance and correction of bounding box alignments and segmentation masks led to substantial model accuracy gains. • Coordinated large-scale image labeling campaigns for object detection • Applied augmentation (rotation, occlusion, noise) for data enrichment • Performed hands-on QA, correcting annotation errors and inconsistencies • Directly contributed to model improvement through high-quality data labeling
Bachelor of Technology, Information Technology
Senior Associate Technology, AI/ML/Data Science
Data Scientist