For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
A

Ashutosh Roy

Computer Vision & Audio Model Specialist | Data Annotation Engineer | Fine tune Model

INDIA flag
Delhi, India
$22.00/hrExpertSuperannotateLabel StudioGoogle Cloud Vertex AI

Key Skills

Software

SuperAnnotateSuperAnnotate
Label StudioLabel Studio
Google Cloud Vertex AIGoogle Cloud Vertex AI
LabelboxLabelbox
RoboflowRoboflow

Top Subject Matter

Document Processing/Machine Learning
Archaeological Document Processing
Speech Emotion Recognition

Top Data Types

DocumentDocument
AudioAudio
ImageImage

Top Task Types

Entity Ner Classification
Emotion Recognition
Bounding Box
Segmentation
Classification
Object Detection
Text Generation
Question Answering
Text Summarization
Fine Tuning
Transcription
Computer Programming Coding
Data Collection

Freelancer Overview

Research Intern, IIT Delhi. Brings 2+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include OpenCV and Python. Education includes Bachelor of Technology, CSE - AI(Specialist) . AI-training focus includes data types such as Document, Audio, and Medical and labeling workflows including Entity (NER) Classification and Emotion Recognition.

ExpertHindiEnglish

Labeling Experience

Data Annotator (Speech Emotion Dataset Processing)

AudioEmotion Recognition
In the Speech Emotion Dataset Processing project, I processed audio datasets and extracted important features for machine learning tasks. I carefully organized and validated labeled datasets, ensuring consistency and reliability for emotion recognition systems. My responsibilities included comprehensive data cleaning and verification. • Processed and labeled audio datasets for ML. • Extracted relevant audio features including MFCC, pitch, and energy. • Validated labeled datasets for emotion recognition. • Ensured consistency and quality of annotations.

In the Speech Emotion Dataset Processing project, I processed audio datasets and extracted important features for machine learning tasks. I carefully organized and validated labeled datasets, ensuring consistency and reliability for emotion recognition systems. My responsibilities included comprehensive data cleaning and verification. • Processed and labeled audio datasets for ML. • Extracted relevant audio features including MFCC, pitch, and energy. • Validated labeled datasets for emotion recognition. • Ensured consistency and quality of annotations.

2025 - 2025

Data Annotator (Archaeological Data Processing System)

DocumentEntity Ner Classification
For the Archaeological Data Processing System project, I converted scanned documents into structured datasets through OCR methods. I ensured accurate annotation of text and formatted data for various analytical and search-oriented use cases. This work involved careful data curation and consistent application of annotation guidelines. • Implemented OCR for structured document conversion. • Cleaned and formatted annotated data. • Annotated textual and metadata elements with precision. • Maintained data quality for analysis purposes.

For the Archaeological Data Processing System project, I converted scanned documents into structured datasets through OCR methods. I ensured accurate annotation of text and formatted data for various analytical and search-oriented use cases. This work involved careful data curation and consistent application of annotation guidelines. • Implemented OCR for structured document conversion. • Cleaned and formatted annotated data. • Annotated textual and metadata elements with precision. • Maintained data quality for analysis purposes.

2025 - 2025

Research Intern, IIT Delhi

DocumentEntity Ner Classification
As a Research Intern at IIT Delhi, I processed and labeled large-scale scanned documents using OCR technologies for machine learning applications. I focused on cleaning, annotating, and validating extracted text data to ensure high-accuracy datasets. Quality evaluation was performed using recognition metrics and structured data organization was maintained throughout the project. • Managed OCR-based extraction and annotation of document data. • Labeled and cleaned textual data for downstream ML tasks. • Conducted validation and quality assurance using metrics such as WER and CER. • Organized and maintained structured datasets for efficient access.

As a Research Intern at IIT Delhi, I processed and labeled large-scale scanned documents using OCR technologies for machine learning applications. I focused on cleaning, annotating, and validating extracted text data to ensure high-accuracy datasets. Quality evaluation was performed using recognition metrics and structured data organization was maintained throughout the project. • Managed OCR-based extraction and annotation of document data. • Labeled and cleaned textual data for downstream ML tasks. • Conducted validation and quality assurance using metrics such as WER and CER. • Organized and maintained structured datasets for efficient access.

2025 - 2025

Data Annotator (Medical Data Processing – Chatbot)

Entity Ner Classification
For the Medical Data Processing (Chatbot) project, I extracted and structured information from medical reports using OCR technology. I prepared cleaned and annotated datasets, making them suitable for NLP-based systems and analysis. High attention was given to labeling accuracy and dataset preparation protocols. • Utilized OCR to extract medical data from reports. • Structured and cleaned data for NLP training. • Annotated key medical entities and information. • Maintained high standards of labeling precision.

For the Medical Data Processing (Chatbot) project, I extracted and structured information from medical reports using OCR technology. I prepared cleaned and annotated datasets, making them suitable for NLP-based systems and analysis. High attention was given to labeling accuracy and dataset preparation protocols. • Utilized OCR to extract medical data from reports. • Structured and cleaned data for NLP training. • Annotated key medical entities and information. • Maintained high standards of labeling precision.

2024 - 2024

Education

C

CSVTU - UTD

Bachelor of Technology CSE, Computer Science, Artificial Intelligence

Bachelor of Technology CSE
2022 - 2026

Work History

I

IIT Delhi

Research Intern

Delhi
2025 - 2025
M

MitoVoid AI

AI/ML Engineer Intern

Gurugram
2024 - 2024