Ashutosh Roy - Computer Vision & Audio Model Specialist | Data Annotation Engineer | Fine tune Model

Key Skills

Software

SuperAnnotate

Label Studio

Google Cloud Vertex AI

Labelbox

Roboflow

Top Subject Matter

Document Processing/Machine Learning

Archaeological Document Processing

Speech Emotion Recognition

Top Data Types

Document

Audio

Image

Top Task Types

Entity Ner Classification

Emotion Recognition

Bounding Box

Segmentation

Classification

Object Detection

Text Generation

Question Answering

Text Summarization

Fine Tuning

Transcription

Computer Programming Coding

Data Collection

Freelancer Overview

Research Intern, IIT Delhi. Brings 2+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include OpenCV and Python. Education includes Bachelor of Technology, CSE - AI(Specialist) . AI-training focus includes data types such as Document, Audio, and Medical and labeling workflows including Entity (NER) Classification and Emotion Recognition.

ExpertHindiEnglish

Labeling Experience

Data Annotator (Speech Emotion Dataset Processing)

AudioEmotion Recognition

In the Speech Emotion Dataset Processing project, I processed audio datasets and extracted important features for machine learning tasks. I carefully organized and validated labeled datasets, ensuring consistency and reliability for emotion recognition systems. My responsibilities included comprehensive data cleaning and verification. • Processed and labeled audio datasets for ML. • Extracted relevant audio features including MFCC, pitch, and energy. • Validated labeled datasets for emotion recognition. • Ensured consistency and quality of annotations.

2025 - 2025

Data Annotator (Archaeological Data Processing System)

DocumentEntity Ner Classification

For the Archaeological Data Processing System project, I converted scanned documents into structured datasets through OCR methods. I ensured accurate annotation of text and formatted data for various analytical and search-oriented use cases. This work involved careful data curation and consistent application of annotation guidelines. • Implemented OCR for structured document conversion. • Cleaned and formatted annotated data. • Annotated textual and metadata elements with precision. • Maintained data quality for analysis purposes.

2025 - 2025

Research Intern, IIT Delhi

DocumentEntity Ner Classification

As a Research Intern at IIT Delhi, I processed and labeled large-scale scanned documents using OCR technologies for machine learning applications. I focused on cleaning, annotating, and validating extracted text data to ensure high-accuracy datasets. Quality evaluation was performed using recognition metrics and structured data organization was maintained throughout the project. • Managed OCR-based extraction and annotation of document data. • Labeled and cleaned textual data for downstream ML tasks. • Conducted validation and quality assurance using metrics such as WER and CER. • Organized and maintained structured datasets for efficient access.

2025 - 2025

Data Annotator (Medical Data Processing – Chatbot)

Entity Ner Classification

For the Medical Data Processing (Chatbot) project, I extracted and structured information from medical reports using OCR technology. I prepared cleaned and annotated datasets, making them suitable for NLP-based systems and analysis. High attention was given to labeling accuracy and dataset preparation protocols. • Utilized OCR to extract medical data from reports. • Structured and cleaned data for NLP training. • Annotated key medical entities and information. • Maintained high standards of labeling precision.

2024 - 2024

Education

C

CSVTU - UTD

Bachelor of Technology CSE, Computer Science, Artificial Intelligence

Bachelor of Technology CSE

2022 - 2026

Work History

I

IIT Delhi

Research Intern

Delhi

2025 - 2025

M

MitoVoid AI

AI/ML Engineer Intern

Gurugram

2024 - 2024