For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Aayush Kumar

Aayush Kumar

Delivery Manager - AI Data Operations

INDIA flag
Bangalore, India
$40.00/hrExpertGoogle Cloud Vertex AIMercorOther

Key Skills

Software

Google Cloud Vertex AIGoogle Cloud Vertex AI
MercorMercor
Other
Internal/Proprietary Tooling

Top Subject Matter

No subject matter listed

Top Data Types

Computer Code ProgrammingComputer Code Programming
DocumentDocument
ImageImage
TextText
VideoVideo

Top Label Types

Evaluation Rating
Fine Tuning
Prompt Response Writing SFT
RLHF

Freelancer Overview

I have hands-on experience in AI data operations, data annotation, and managing large-scale AI training data projects. My background includes leading teams in the creation and quality assurance of RLHF, CUA, and SFT datasets for large language models, ensuring high annotation standards and strict guideline compliance. I have worked directly on structured data labeling, technical prompt annotation, and refining annotation guidelines to improve accuracy and consistency for LLM training and benchmarking. My technical skills span Python, shell scripting, AWS, and cloud-based data workflows, and I have contributed to computer vision projects such as vehicle detection using CNNs. With experience coordinating teams of over 250 contributors and optimizing data pipelines, I am adept at delivering high-quality training data for enterprise AI applications.

ExpertEnglishHindi

Labeling Experience

Delivery Manager

Internal Proprietary ToolingTextRLHFFine Tuning
– Started as a trainer for Codeforces-style competitive programming tasks, evaluating contributor solutions and guiding improvements in algorithmic reasoning and code quality. – Worked on the OSWorld RLHF dataset and later managed RL data operations for a team of 20 contributors focused on reinforcement learning training data. – Led development of CUA and SFT datasets used for training large language models while ensuring annotation quality and guideline compliance. – Promoted to Delivery Manager managing 250+ contributors and 10 POD Leads while coordinating dataset production for enterprise clients including Alibaba. – Designed task distribution pipelines, review workflows, and quality control systems to maintain high quality LLM training data.

– Started as a trainer for Codeforces-style competitive programming tasks, evaluating contributor solutions and guiding improvements in algorithmic reasoning and code quality. – Worked on the OSWorld RLHF dataset and later managed RL data operations for a team of 20 contributors focused on reinforcement learning training data. – Led development of CUA and SFT datasets used for training large language models while ensuring annotation quality and guideline compliance. – Promoted to Delivery Manager managing 250+ contributors and 10 POD Leads while coordinating dataset production for enterprise clients including Alibaba. – Designed task distribution pipelines, review workflows, and quality control systems to maintain high quality LLM training data.

2025 - 2025
Mercor

LLM Researcher

MercorDocumentEvaluation Rating
– Performed structured data annotation & evaluation for datasets used in training & benchmarking large language models. – Annotated technical prompts, responses, and reasoning chains to improve dataset accuracy and model alignment. – Worked with reviewers to refine annotation guidelines & maintain consistency across large scale labeling tasks.

– Performed structured data annotation & evaluation for datasets used in training & benchmarking large language models. – Annotated technical prompts, responses, and reasoning chains to improve dataset accuracy and model alignment. – Worked with reviewers to refine annotation guidelines & maintain consistency across large scale labeling tasks.

2025 - 2025

Education

U

UC Berkely

Masters, Computer Optimization

Masters
2025 - 2025
N

National Institute of Technology, Kurukshetra

Bachelor of Technology, Computer Engineering

Bachelor of Technology
2021 - 2025

Work History

A

Airbus

DevOps Engineer

Bengaluru
2025 - Present