For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
A

Ankit Kumar

Generative AI Generalist (RLHF/Data Labeling)

India flagKhagaria, India
$12.00/hrEntry LevelScale AI

Key Skills

Software

Scale AIScale AI

Top Subject Matter

Large Language Models
AI Safety
Rlhf Domain Expertise

Top Data Types

TextText
ImageImage
VideoVideo

Top Task Types

RLHF
Data Collection
Classification
Object Detection
Segmentation

Freelancer Overview

Generative AI Generalist (RLHF/Data Labeling). Brings 2+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. AI-training focus includes data types such as Text and labeling workflows including RLHF.

Entry LevelEnglishHindi

Labeling Experience

Derendering Websites.

VideoData Collection
Recording interaction video for the requested site, triggering several interactions across pages. Deconstruction of different animations and UI components. Detailed description of design and direction of website.

Recording interaction video for the requested site, triggering several interactions across pages. Deconstruction of different animations and UI components. Detailed description of design and direction of website.

2026 - Present

Generative AI Generalist (RLHF/Data Labeling)

TextRLHF
Executed complex RLHF tasks to fine-tune large language models focusing on reasoning, logic, and safety within frontier AI systems. Conducted detailed model evaluations by benchmarking outputs against strict quality rubrics to identify hallucinations and logical fallacies. Performed multi-turn prompt engineering to test boundaries and deliver high-density data for RL environment training. • Delivered high-quality labeled data in a fast-paced, evolving workflow. • Benchmarked AI outputs against rigorous criteria to ensure quality and accuracy. • Created and refined data for RLHF and model fine-tuning objectives. • Collaborated with teams to adapt labeling processes to dynamic project demands.

Executed complex RLHF tasks to fine-tune large language models focusing on reasoning, logic, and safety within frontier AI systems. Conducted detailed model evaluations by benchmarking outputs against strict quality rubrics to identify hallucinations and logical fallacies. Performed multi-turn prompt engineering to test boundaries and deliver high-density data for RL environment training. • Delivered high-quality labeled data in a fast-paced, evolving workflow. • Benchmarked AI outputs against rigorous criteria to ensure quality and accuracy. • Created and refined data for RLHF and model fine-tuning objectives. • Collaborated with teams to adapt labeling processes to dynamic project demands.

2026 - Present

Education

S

Saint Xavier's College Ranchi

Senior Secondary, Sciences

Senior Secondary
2017 - 2019

Work History

I

Independent

Web Product Designer & Developer

Ranchi
2024 - 2025