Ankit Kumar - Generative AI Generalist (RLHF/Data Labeling)

Key Skills

Software

Scale AI

Top Subject Matter

Large Language Models

AI Safety

Rlhf Domain Expertise

Top Data Types

Text

Image

Video

Top Task Types

RLHF

Data Collection

Classification

Object Detection

Segmentation

Freelancer Overview

Generative AI Generalist (RLHF/Data Labeling). Brings 2+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. AI-training focus includes data types such as Text and labeling workflows including RLHF.

Entry LevelEnglishHindi

Labeling Experience

Derendering Websites.

VideoData Collection

Recording interaction video for the requested site, triggering several interactions across pages. Deconstruction of different animations and UI components. Detailed description of design and direction of website.

2026 - Present

Generative AI Generalist (RLHF/Data Labeling)

TextRLHF

Executed complex RLHF tasks to fine-tune large language models focusing on reasoning, logic, and safety within frontier AI systems. Conducted detailed model evaluations by benchmarking outputs against strict quality rubrics to identify hallucinations and logical fallacies. Performed multi-turn prompt engineering to test boundaries and deliver high-density data for RL environment training. • Delivered high-quality labeled data in a fast-paced, evolving workflow. • Benchmarked AI outputs against rigorous criteria to ensure quality and accuracy. • Created and refined data for RLHF and model fine-tuning objectives. • Collaborated with teams to adapt labeling processes to dynamic project demands.

2026 - Present

Education

S

Saint Xavier's College Ranchi

Senior Secondary, Sciences

Senior Secondary

2017 - 2019

Work History

I

Independent

Web Product Designer & Developer

Ranchi

2024 - 2025