For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
V
Vansh Mahajan

Vansh Mahajan

Software Engineer (AI Data Labeling & RLHF)

India flagN/A, India
$25.00/hrIntermediateInternal Proprietary Tooling

Key Skills

Software

Internal/Proprietary Tooling

Top Subject Matter

AI model training and evaluation

Top Data Types

TextText
Computer Code ProgrammingComputer Code Programming

Top Task Types

Prompt + Response Writing (SFT)Prompt + Response Writing (SFT)

Freelancer Overview

Software Engineer (AI Data Labeling & RLHF). Brings 1+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Technology, Guru Nanak Dev University (2025). AI-training focus includes data types such as Text and labeling workflows including Prompt + Response Writing (SFT).

IntermediateEnglish

Labeling Experience

Software Engineer (AI Data Labeling & RLHF)

TextPrompt Response Writing SFT
Led supervised fine-tuning by developing and validating high-quality, task-specific prompt and response datasets to improve model accuracy. Collaborated with trainers, built review/approval pipelines, and designed workflow history tracking for greater labeling reliability. Executed RLHF workflows in partnership with annotators, refining reward models to align AI output with user expectations. • Built analytics dashboards and LaTeX rendering for improved monitoring. • Engineered CI/CD pipelines for training and evaluating AI models. • Utilized Docker for reproducible environments. • Implemented secure role-based authentication with Firebase Google OAuth.

Led supervised fine-tuning by developing and validating high-quality, task-specific prompt and response datasets to improve model accuracy. Collaborated with trainers, built review/approval pipelines, and designed workflow history tracking for greater labeling reliability. Executed RLHF workflows in partnership with annotators, refining reward models to align AI output with user expectations. • Built analytics dashboards and LaTeX rendering for improved monitoring. • Engineered CI/CD pipelines for training and evaluating AI models. • Utilized Docker for reproducible environments. • Implemented secure role-based authentication with Firebase Google OAuth.

2025 - Present

Education

G

Guru Nanak Dev University

Bachelor of Technology, Computer Science and Engineering

Bachelor of Technology
2021 - 2025

Work History

T

Turing

Software engineer

San Fransico
2025 - Present
G

Genesis Techno Soft

Full Stack Developer Intern

N/A
2025 - 2025