For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
M
Mritunjay Pandey

Mritunjay Pandey

SFT & RLHF Prompt Engineer (Google DeepMind)

India flagDaltonganj, India
$29.00/hrIntermediateRemotasksScale AIGoogle Cloud Vertex AI

Key Skills

Software

RemotasksRemotasks
Scale AIScale AI
Google Cloud Vertex AIGoogle Cloud Vertex AI
Deep SystemsDeep Systems
Data Annotation TechData Annotation Tech
AWS SageMakerAWS SageMaker

Top Subject Matter

AI SFT RLHF Prompt Engineering
Machine learning Data Scientist Python Coding Expertise
STEM Domains

Top Data Types

Computer Code ProgrammingComputer Code Programming
TextText
ImageImage

Top Task Types

RLHFRLHF
Prompt + Response Writing (SFT)Prompt + Response Writing (SFT)
Computer Programming/CodingComputer Programming/Coding
Evaluation/RatingEvaluation/Rating
TranscriptionTranscription
Fine-tuningFine-tuning
Text SummarizationText Summarization
Question AnsweringQuestion Answering
Text GenerationText Generation
Object DetectionObject Detection
Entity (NER) ClassificationEntity (NER) Classification
ClassificationClassification
Data CollectionData Collection

Freelancer Overview

SFT & RLHF Prompt Engineer (Google DeepMind). Brings 3+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Technology, Dr. B R Ambedkar National Institute of Technology (NIT), Jalandhar. AI-training focus includes data types such as Computer Code and Programming and labeling workflows including RLHF.

IntermediateEnglishHindi

Labeling Experience

SFT & RLHF Prompt Engineer (Google DeepMind)

RLHF
Created more than 700 gold-standard Python and STEM triads facilitating Chain-of-Thought reasoning for reinforcement learning from human feedback (RLHF) training of advanced AI models. Evaluated and rated over 2,000 multi-turn prompt/response conversations in accordance with established Standard Operating Procedures (SOPs), incorporating Human-in-the-Loop workflows. Maintained exceptionally high quality control accuracy while progressing to a Reviewer role responsible for final dataset validation. • Authored datasets targeting model alignment and consistency improvements. • Focused on AI evaluation for robotics and technical domains. • Employed coding triad review and prompt engineering expertise. • Directly contributed to AI chatbot and LLM training cycles.

Created more than 700 gold-standard Python and STEM triads facilitating Chain-of-Thought reasoning for reinforcement learning from human feedback (RLHF) training of advanced AI models. Evaluated and rated over 2,000 multi-turn prompt/response conversations in accordance with established Standard Operating Procedures (SOPs), incorporating Human-in-the-Loop workflows. Maintained exceptionally high quality control accuracy while progressing to a Reviewer role responsible for final dataset validation. • Authored datasets targeting model alignment and consistency improvements. • Focused on AI evaluation for robotics and technical domains. • Employed coding triad review and prompt engineering expertise. • Directly contributed to AI chatbot and LLM training cycles.

2024 - 2025

Education

D

Dr. B R Ambedkar National Institute of Technology (NIT), Jalandhar

Bachelor of Technology, Engineering

Bachelor of Technology
2020 - 2024

Work History

A

Aditya Birla Group

Lead Data Scientist

Mumbai
2025 - Present
N

Namekart Pvt. Ltd.

AI Engineer

Noida
2025 - 2025