For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
S
Sakinala Sri Sai Pawan

Sakinala Sri Sai Pawan

AI Model Evaluator & RLHF Trainer

India flagHyderabad, India
$12.00/hrEntry LevelOther

Key Skills

Software

Other

Top Subject Matter

AI model evaluation and training
Artificial Intelligence (AI) & Machine Learning (ML)
Generative AI (LLMs)

Top Data Types

TextText
ImageImage
AudioAudio

Top Task Types

RLHFRLHF
Evaluation/RatingEvaluation/Rating
Computer Programming/CodingComputer Programming/Coding

Freelancer Overview

AI Model Evaluator & RLHF Trainer Experience evaluating AI model outputs, improving data quality and supporting reinforcement learning from human feedback (RLHF) workflows. Primarily concerned with accurate annotations and consistency in text-based datasets. Key strengths include analytical thinking, attention to detail, quality assurance, and structured evaluation of model responses across a variety of scenarios. Education:* Master of Technology (M. Tech) Osmania University (2025) Bachelor of Technology (B-Tech) KL University (2025) AI Training Know-How: Familiarity with handling text data, helping with labelling workflows, including RLHF-based model evaluation and feedback workflows.

Entry LevelEnglishTeluguHindi

Labeling Experience

AI Model Evaluator & RLHF Trainer

OtherTextRLHF
As an AI Model Evaluator & RLHF Trainer at Deccan AI, I participated in reinforcement learning from human feedback (RLHF) pipelines to enhance AI model quality. My daily tasks included evaluating model outputs for performance, accuracy, and reasoning, and delivering comprehensive feedback. I consistently identified factual errors, inconsistencies, and poor reasoning within AI-generated responses. • Compared multiple model-generated responses and selected the best output based on defined criteria. • Collaborated with the AI development team to refine prompt guidelines and evaluation rubrics. • Contributed to data annotation processes that supported continuous AI improvement. • Ensured high labeling quality by adhering to standardized evaluation protocols.

As an AI Model Evaluator & RLHF Trainer at Deccan AI, I participated in reinforcement learning from human feedback (RLHF) pipelines to enhance AI model quality. My daily tasks included evaluating model outputs for performance, accuracy, and reasoning, and delivering comprehensive feedback. I consistently identified factual errors, inconsistencies, and poor reasoning within AI-generated responses. • Compared multiple model-generated responses and selected the best output based on defined criteria. • Collaborated with the AI development team to refine prompt guidelines and evaluation rubrics. • Contributed to data annotation processes that supported continuous AI improvement. • Ensured high labeling quality by adhering to standardized evaluation protocols.

2025 - Present

Education

K

KL University

Bachelor of Technology, Computer Science (Artificial Intelligence and Intelligent Process Automation)

Bachelor of Technology
2021 - 2025
O

Osmania University

Master of Technology, Artificial Intelligence and Machine Learning

Master of Technology
2025

Work History

D

Deccan

AI Model Evaluator & RHLF Trainer

Hyderabad
2025 - Present