For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
S

Shrikant Lahase

AI Systems Evaluation Analyst — LLMs & AI Agents

INDIA flag
Dhamangaon Badhe, India
$15.00/hrEntry LevelLabel StudioDoccanoLabelbox

Key Skills

Software

Label StudioLabel Studio
DoccanoDoccano
LabelboxLabelbox
Internal/Proprietary Tooling

Top Subject Matter

General Knowledge & Education
Artificial Intelligence & Machine Learning, Computer Science & Programming
Engineering & Technology

Top Data Types

TextText
ImageImage
DocumentDocument

Top Task Types

RLHF
Classification
Segmentation
Evaluation Rating
Red Teaming
Question Answering
Text Generation
Polygon
Bounding Box
Polyline
Cuboid
Object Detection
Fine Tuning

Freelancer Overview

I have experience working on large-scale AI training data evaluation through my role as an AI Systems Evaluation Analyst (RLHF) at Turing, where I assess the performance of browser-based AI agents capable of completing real-world tasks. My work involves performing side-by-side (SxS) comparisons of model outputs, evaluating reasoning quality, factual accuracy, task completion, and overall response relevance. I also conduct detailed fact-checking to detect hallucinations, logical inconsistencies, and reliability issues in large language model responses, providing structured feedback and written justifications that help improve model alignment and evaluation benchmarks. In addition to AI evaluation, I bring a strong technical background in software engineering and data systems, including experience building scalable data pipelines and processing large datasets using Python, SQL, and distributed tools like PySpark and Databricks. My research work involved processing 10+ TB of climate datasets and implementing analytical workflows for large-scale data analysis. This combination of analytical reasoning, technical expertise, and structured evaluation methodology enables me to contribute effectively to high-quality AI training data and model evaluation pipelines.

Entry LevelMarathiHindiEnglish

Labeling Experience

AI Systems Evaluation Analyst (RLHF) — Contractor

TextRLHF
As an AI Systems Evaluation Analyst (RLHF), I evaluated browser-based AI agents for reasoning quality, factual accuracy, and success in real-world user tasks. My work focused on structured side-by-side model comparisons, fact-checking, and detailed analysis to detect errors and performance gaps. I provided analytical feedback and justifications that contributed to improved AI model alignment and system benchmarks. • Assessed and rated LLM outputs for reliability and logical consistency. • Conducted structured evaluations including SxS comparisons and hallucination detection. • Generated analytical reports that informed benchmark and alignment improvements. • Utilized proprietary or internal evaluation tools provided by the client in a remote setting.

As an AI Systems Evaluation Analyst (RLHF), I evaluated browser-based AI agents for reasoning quality, factual accuracy, and success in real-world user tasks. My work focused on structured side-by-side model comparisons, fact-checking, and detailed analysis to detect errors and performance gaps. I provided analytical feedback and justifications that contributed to improved AI model alignment and system benchmarks. • Assessed and rated LLM outputs for reliability and logical consistency. • Conducted structured evaluations including SxS comparisons and hallucination detection. • Generated analytical reports that informed benchmark and alignment improvements. • Utilized proprietary or internal evaluation tools provided by the client in a remote setting.

2026 - Present

Education

I

Indian Institute of Technology, Bhubaneswar

Bachelor of Technology, Civil Engineering

Bachelor of Technology
2021 - 2025
J

Jawahar Navodaya Vidyalaya, Shegaon

Higher Secondary Certificate, General Studies

Higher Secondary Certificate
2014 - 2021

Work History

T

Turing

Business Analyst

Buldhana
2026 - Present
P

PalTech Consulting

Associate Software Engineer

Hyderabad
2025 - Present