Vansh Mahajan - Software Engineer (AI Data Labeling & RLHF)

Key Skills

Software

Internal/Proprietary Tooling

Top Subject Matter

AI model training and evaluation

Top Data Types

Text

Computer Code Programming

Top Task Types

Prompt + Response Writing (SFT)

Freelancer Overview

Software Engineer (AI Data Labeling & RLHF). Brings 1+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Technology, Guru Nanak Dev University (2025). AI-training focus includes data types such as Text and labeling workflows including Prompt + Response Writing (SFT).

IntermediateEnglish

Labeling Experience

Software Engineer (AI Data Labeling & RLHF)

TextPrompt Response Writing SFT

Led supervised fine-tuning by developing and validating high-quality, task-specific prompt and response datasets to improve model accuracy. Collaborated with trainers, built review/approval pipelines, and designed workflow history tracking for greater labeling reliability. Executed RLHF workflows in partnership with annotators, refining reward models to align AI output with user expectations. • Built analytics dashboards and LaTeX rendering for improved monitoring. • Engineered CI/CD pipelines for training and evaluating AI models. • Utilized Docker for reproducible environments. • Implemented secure role-based authentication with Firebase Google OAuth.

2025 - Present

Education

G

Guru Nanak Dev University

Bachelor of Technology, Computer Science and Engineering

Bachelor of Technology

2021 - 2025

Work History

T

Turing

Software engineer

San Fransico

2025 - Present

G

Genesis Techno Soft

Full Stack Developer Intern

N/A

2025 - 2025