For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
V
Vaibhav

Vaibhav

Technical AI Trainer & RLHF Specialist

India flagN/A, India
$45.00/hrExpertDon T Disclose

Key Skills

Software

Don't disclose

Top Subject Matter

LLM Safety
Coding (Java/C++)
Chain-of-Thought Auditing

Top Data Types

TextText
DocumentDocument

Top Task Types

Red TeamingRed Teaming
Fine-tuningFine-tuning

Freelancer Overview

Technical AI Trainer & RLHF Specialist. Brings 3+ years of professional experience across legal operations, contract review, compliance, and structured analysis. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Technology, Rayat Bahra University (2027). AI-training focus includes data types such as Text, Computer Code, and Programming and labeling workflows including Red Teaming and Fine-tuning.

ExpertEnglish

Labeling Experience

Technical AI Trainer & RLHF Specialist

TextRed Teaming
As a Technical AI Trainer and RLHF Specialist, I performed adversarial testing on advanced large language models to identify security vulnerabilities and logical errors. My work included conducting Supervised Fine-Tuning (SFT) with expert-level code and problem solutions and auditing AI-generated code for hallucinations and incorrect logic. I created high-level Chain-of-Thought (CoT) responses for mathematical, medical, and legal AI training workflows. • Exposed vulnerabilities in reasoning and logic in LLMs via red teaming and adversarial prompt creation. • Drafted reference solutions in Java and C++ for complex data structures and algorithm tasks. • Identified non-idiomatic or faulty code patterns in model outputs to improve generation accuracy. • Generated CoT reasoning data to guide AI on solving multi-step, high-complexity queries.

As a Technical AI Trainer and RLHF Specialist, I performed adversarial testing on advanced large language models to identify security vulnerabilities and logical errors. My work included conducting Supervised Fine-Tuning (SFT) with expert-level code and problem solutions and auditing AI-generated code for hallucinations and incorrect logic. I created high-level Chain-of-Thought (CoT) responses for mathematical, medical, and legal AI training workflows. • Exposed vulnerabilities in reasoning and logic in LLMs via red teaming and adversarial prompt creation. • Drafted reference solutions in Java and C++ for complex data structures and algorithm tasks. • Identified non-idiomatic or faulty code patterns in model outputs to improve generation accuracy. • Generated CoT reasoning data to guide AI on solving multi-step, high-complexity queries.

2024 - Present

Technical Project Lead

Fine Tuning
As a Technical Project Lead, I engineered and debugged intricate data structure implementations for use as ground truth in model training. My responsibilities also included refining AI-generated technical manuals to a professional English standard and ensuring clarity for model comprehension. I worked independently to provide polished, accurate training data for advanced AI systems. • Created high-fidelity data structure and algorithm code references for AI model fine-tuning. • Upgraded technical documentation to a C1-level for global engineering audiences. • Ensured ground truth code validity and logical precision for LLM development. • Delivered continuous code-based data for AI learning cycles throughout project duration.

As a Technical Project Lead, I engineered and debugged intricate data structure implementations for use as ground truth in model training. My responsibilities also included refining AI-generated technical manuals to a professional English standard and ensuring clarity for model comprehension. I worked independently to provide polished, accurate training data for advanced AI systems. • Created high-fidelity data structure and algorithm code references for AI model fine-tuning. • Upgraded technical documentation to a C1-level for global engineering audiences. • Ensured ground truth code validity and logical precision for LLM development. • Delivered continuous code-based data for AI learning cycles throughout project duration.

2023 - 2025

Education

R

Rayat Bahra University

Bachelor of Technology, Computer Science and Engineering

Bachelor of Technology
2023 - 2027

Work History

I

Independent

Technical Project Lead

N/A
2023 - 2025