For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
A

Ashish Kumar Singh

AI Trainer (Freelance) | Outlier AI

INDIA flag
Jharkhand, India
$20.00/hrIntermediateOther

Key Skills

Software

Other

Top Subject Matter

LLM Training
Coding Domain Expertise
Reasoning Domain Expertise

Top Data Types

TextText
Computer Code ProgrammingComputer Code Programming
DocumentDocument

Top Task Types

Classification
Computer Programming Coding
Data Collection
Object Detection
Text Generation
Text Summarization
RLHF
Question Answering
Fine Tuning
Evaluation Rating
Prompt Response Writing SFT
Transcription

Freelancer Overview

AI Trainer (Freelance) | Outlier AI. Brings 3+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Other. Education includes Bachelor of Technology, IIEST Shibpur (2025). AI-training focus includes data types such as Text and labeling workflows including RLHF.

IntermediateBengaliHindiEnglish

Labeling Experience

AI Trainer (Freelance) | Outlier AI

OtherTextRLHF
As an AI Trainer for Outlier AI, I annotated more than 500 text samples weekly to build and refine large language model datasets. I authored prompts across coding and reasoning tasks and provided RLHF feedback to reduce hallucinations and biases in model responses. Additionally, I evaluated model outputs ensuring correctness and the application of strict labeling guidelines. • Conducted high-volume weekly data annotation for LLM training. • Designed and reviewed hundreds of prompts spanning technical and reasoning domains. • Provided RLHF-based feedback to fine-tune model outputs, decreasing error rates noticeably. • Maintained dataset quality and labeling integrity through guideline enforcement.

As an AI Trainer for Outlier AI, I annotated more than 500 text samples weekly to build and refine large language model datasets. I authored prompts across coding and reasoning tasks and provided RLHF feedback to reduce hallucinations and biases in model responses. Additionally, I evaluated model outputs ensuring correctness and the application of strict labeling guidelines. • Conducted high-volume weekly data annotation for LLM training. • Designed and reviewed hundreds of prompts spanning technical and reasoning domains. • Provided RLHF-based feedback to fine-tune model outputs, decreasing error rates noticeably. • Maintained dataset quality and labeling integrity through guideline enforcement.

2025 - Present

Education

I

IIEST Shibpur

Bachelor of Technology, Information Technology

Bachelor of Technology
2021 - 2025

Work History

T

Tata Consultancy Services

Workato Automation Engineer

Kolkata
2025 - Present
M

MNNIT Allahabad

Research Intern

Allahabad
2024 - 2024