Shrikant Lahase - AI Systems Evaluation Analyst — LLMs & AI Agents

Key Skills

Software

Label Studio

Doccano

Labelbox

Internal/Proprietary Tooling

Top Subject Matter

General Knowledge & Education

Artificial Intelligence & Machine Learning, Computer Science & Programming

Engineering & Technology

Top Data Types

Text

Image

Document

Top Task Types

RLHF

Classification

Segmentation

Evaluation Rating

Red Teaming

Question Answering

Text Generation

Polygon

Bounding Box

Polyline

Cuboid

Object Detection

Fine Tuning

Freelancer Overview

I have experience working on large-scale AI training data evaluation through my role as an AI Systems Evaluation Analyst (RLHF) at Turing, where I assess the performance of browser-based AI agents capable of completing real-world tasks. My work involves performing side-by-side (SxS) comparisons of model outputs, evaluating reasoning quality, factual accuracy, task completion, and overall response relevance. I also conduct detailed fact-checking to detect hallucinations, logical inconsistencies, and reliability issues in large language model responses, providing structured feedback and written justifications that help improve model alignment and evaluation benchmarks. In addition to AI evaluation, I bring a strong technical background in software engineering and data systems, including experience building scalable data pipelines and processing large datasets using Python, SQL, and distributed tools like PySpark and Databricks. My research work involved processing 10+ TB of climate datasets and implementing analytical workflows for large-scale data analysis. This combination of analytical reasoning, technical expertise, and structured evaluation methodology enables me to contribute effectively to high-quality AI training data and model evaluation pipelines.

Entry LevelMarathiHindiEnglish

Labeling Experience

AI Systems Evaluation Analyst (RLHF) — Contractor

TextRLHF

As an AI Systems Evaluation Analyst (RLHF), I evaluated browser-based AI agents for reasoning quality, factual accuracy, and success in real-world user tasks. My work focused on structured side-by-side model comparisons, fact-checking, and detailed analysis to detect errors and performance gaps. I provided analytical feedback and justifications that contributed to improved AI model alignment and system benchmarks. • Assessed and rated LLM outputs for reliability and logical consistency. • Conducted structured evaluations including SxS comparisons and hallucination detection. • Generated analytical reports that informed benchmark and alignment improvements. • Utilized proprietary or internal evaluation tools provided by the client in a remote setting.

2026 - Present

Education

I

Indian Institute of Technology, Bhubaneswar

Bachelor of Technology, Civil Engineering

Bachelor of Technology

2021 - 2025

J

Jawahar Navodaya Vidyalaya, Shegaon

Higher Secondary Certificate, General Studies

Higher Secondary Certificate

2014 - 2021

Work History

T

Turing

Business Analyst

Buldhana

2026 - Present

P

PalTech Consulting

Associate Software Engineer

Hyderabad

2025 - Present