For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
A

Aniket Pandey

AI Training Engineer & RLHF Specialist

India flaggreater noida, India
$5.00/hrIntermediate

Key Skills

Software

No software listed

Top Subject Matter

Large Language Models
Language Model Evaluation
Computer Vision

Top Data Types

TextText
ImageImage
DocumentDocument

Top Task Types

RLHF
Classification

Freelancer Overview

AI Training Engineer & RLHF Specialist. Brings 2+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Technology, Noida Institute of Engineering & Technology (2025). AI-training focus includes data types such as Text and Image and labeling workflows including RLHF, Evaluation, and Rating.

IntermediateEnglishHindi

Labeling Experience

AI Training Engineer & RLHF Specialist

TextRLHF
Served as an AI Training Engineer responsible for the optimization and alignment of Large Language Models (LLMs) through structured evaluation. Applied reinforcement learning from human feedback (RLHF) methods to improve model accuracy, safety, and instruction alignment. Conducted comprehensive quality checks of model outputs for systemic bias, logical correctness, and adherence to guidelines. • Evaluated model responses for hallucinations, inaccuracies, and instruction-following errors. • Provided human feedback directly used for model RLHF fine-tuning cycles. • Collaborated with engineering teams to implement feedback into training pipelines. • Ensured consistent top-tier annotation quality surpassing team benchmarks.

Served as an AI Training Engineer responsible for the optimization and alignment of Large Language Models (LLMs) through structured evaluation. Applied reinforcement learning from human feedback (RLHF) methods to improve model accuracy, safety, and instruction alignment. Conducted comprehensive quality checks of model outputs for systemic bias, logical correctness, and adherence to guidelines. • Evaluated model responses for hallucinations, inaccuracies, and instruction-following errors. • Provided human feedback directly used for model RLHF fine-tuning cycles. • Collaborated with engineering teams to implement feedback into training pipelines. • Ensured consistent top-tier annotation quality surpassing team benchmarks.

2025 - Present

Image Classification Data Annotator

ImageClassification
Annotated 500+ image samples for computer vision (CV) training pipelines at Ethara.ai. Focused on identifying and classifying images to enhance model training accuracy and reduce noise. Maintained flawless batch quality with no critical errors or rejections. • Structured and cleaned annotation datasets for better model performance. • Applied strict compliance with annotation schemas across all tasks. • Contributed to a 22% reduction in downstream re-review workload. • Supported QA efforts to ensure delivery of high-quality annotated data.

Annotated 500+ image samples for computer vision (CV) training pipelines at Ethara.ai. Focused on identifying and classifying images to enhance model training accuracy and reduce noise. Maintained flawless batch quality with no critical errors or rejections. • Structured and cleaned annotation datasets for better model performance. • Applied strict compliance with annotation schemas across all tasks. • Contributed to a 22% reduction in downstream re-review workload. • Supported QA efforts to ensure delivery of high-quality annotated data.

2025 - 2025

LLM Evaluator & Data Annotation Expert

Text
Worked as an LLM Evaluator to grade over 1,200 LLM responses based on accuracy, coherence, instruction-following, and safety. Maintained a QA-verified annotation accuracy 6 points above the team average. Created error taxonomies and documented over 340 hallucinations and citation fabrications, informing new evaluation guidelines. • Restructured and QA'd raw annotation datasets for downstream improvements. • Flagged systemic prompt-response misalignments and triggered guideline revisions. • Onboarded quickly to multiple annotation schemas with zero violations. • Ensured zero critical-error batches returned throughout the engagement.

Worked as an LLM Evaluator to grade over 1,200 LLM responses based on accuracy, coherence, instruction-following, and safety. Maintained a QA-verified annotation accuracy 6 points above the team average. Created error taxonomies and documented over 340 hallucinations and citation fabrications, informing new evaluation guidelines. • Restructured and QA'd raw annotation datasets for downstream improvements. • Flagged systemic prompt-response misalignments and triggered guideline revisions. • Onboarded quickly to multiple annotation schemas with zero violations. • Ensured zero critical-error batches returned throughout the engagement.

2025 - 2025

Education

N

Noida Institute of Engineering & Technology

Bachelor of Technology, Computer Science and Engineering

Bachelor of Technology
2021 - 2025

Work History

W

wipro 11/

Project Engineer

Location not specified
2025 - Present