Aniket Pandey - AI Training Engineer & RLHF Specialist

Key Skills

Software

No software listed

Top Subject Matter

Large Language Models

Language Model Evaluation

Computer Vision

Top Data Types

Text

Image

Document

Top Task Types

RLHF

Classification

Freelancer Overview

AI Training Engineer & RLHF Specialist. Brings 2+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Technology, Noida Institute of Engineering & Technology (2025). AI-training focus includes data types such as Text and Image and labeling workflows including RLHF, Evaluation, and Rating.

IntermediateEnglishHindi

Labeling Experience

AI Training Engineer & RLHF Specialist

TextRLHF

Served as an AI Training Engineer responsible for the optimization and alignment of Large Language Models (LLMs) through structured evaluation. Applied reinforcement learning from human feedback (RLHF) methods to improve model accuracy, safety, and instruction alignment. Conducted comprehensive quality checks of model outputs for systemic bias, logical correctness, and adherence to guidelines. • Evaluated model responses for hallucinations, inaccuracies, and instruction-following errors. • Provided human feedback directly used for model RLHF fine-tuning cycles. • Collaborated with engineering teams to implement feedback into training pipelines. • Ensured consistent top-tier annotation quality surpassing team benchmarks.

2025 - Present

Image Classification Data Annotator

ImageClassification

Annotated 500+ image samples for computer vision (CV) training pipelines at Ethara.ai. Focused on identifying and classifying images to enhance model training accuracy and reduce noise. Maintained flawless batch quality with no critical errors or rejections. • Structured and cleaned annotation datasets for better model performance. • Applied strict compliance with annotation schemas across all tasks. • Contributed to a 22% reduction in downstream re-review workload. • Supported QA efforts to ensure delivery of high-quality annotated data.

2025 - 2025

LLM Evaluator & Data Annotation Expert

Text

Worked as an LLM Evaluator to grade over 1,200 LLM responses based on accuracy, coherence, instruction-following, and safety. Maintained a QA-verified annotation accuracy 6 points above the team average. Created error taxonomies and documented over 340 hallucinations and citation fabrications, informing new evaluation guidelines. • Restructured and QA'd raw annotation datasets for downstream improvements. • Flagged systemic prompt-response misalignments and triggered guideline revisions. • Onboarded quickly to multiple annotation schemas with zero violations. • Ensured zero critical-error batches returned throughout the engagement.

2025 - 2025

Education

N

Noida Institute of Engineering & Technology

Bachelor of Technology, Computer Science and Engineering

Bachelor of Technology

2021 - 2025

Work History

W

wipro 11/

Project Engineer

Location not specified

2025 - Present