For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
P

Pavan Vanarse

AI Trainer / AI Data Annotator & Evaluator

India flagPune, India
IntermediateOther

Key Skills

Software

Other

Top Subject Matter

Large Language Models
Generative AI
Data Science

Top Data Types

TextText

Top Task Types

RLHF

Freelancer Overview

AI Trainer / AI Data Annotator & Evaluator. Brings 2+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Other. Education includes Bachelor of Commerce, Dr. Babasaheb Ambedkar Marathwada University (2025) and Higher Secondary Certificate, Late Dashrath Baba Madhyamik Vidhyalay Jawkheda (2021). AI-training focus includes data types such as Text and labeling workflows including RLHF.

Intermediate

Labeling Experience

AI Trainer / AI Data Annotator & Evaluator

OtherTextRLHF
As an AI Trainer and Data Annotator & Evaluator at Outlier AI, I evaluated outputs from large language models such as GPT-4 and Claude. My role included annotating and labeling AI-generated responses using structured rubrics and providing feedback that informed model improvement. I contributed to adversarial prompt design and iterative RLHF workflows to enhance model reasoning, factual accuracy, and safety. • Reviewed and ranked LLM outputs for coherence, helpfulness, and alignment with user intent. • Designed and implemented adversarial prompts to test model performance across technical and general domains. • Applied domain expertise to identify hallucinations, bias, and inconsistencies in LLM responses. • Collaborated to set annotation standards and contributed to AI safety research documentation.

As an AI Trainer and Data Annotator & Evaluator at Outlier AI, I evaluated outputs from large language models such as GPT-4 and Claude. My role included annotating and labeling AI-generated responses using structured rubrics and providing feedback that informed model improvement. I contributed to adversarial prompt design and iterative RLHF workflows to enhance model reasoning, factual accuracy, and safety. • Reviewed and ranked LLM outputs for coherence, helpfulness, and alignment with user intent. • Designed and implemented adversarial prompts to test model performance across technical and general domains. • Applied domain expertise to identify hallucinations, bias, and inconsistencies in LLM responses. • Collaborated to set annotation standards and contributed to AI safety research documentation.

2024 - Present

Education

L

Late Dashrath Baba Madhyamik Vidhyalay Jawkheda

Higher Secondary Certificate, Science

Higher Secondary Certificate
2021 - 2021
J

Jay Bhavani Vidhya Mandir Devleghvan

Secondary School Certificate, General Education

Secondary School Certificate
2019 - 2019

Work History

A

Access Million

Data Analyst Intern

Pune
2024 - 2025