For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
R
Rajdeep Hazra

Rajdeep Hazra

AI Response Evaluator (Chatterly AI Project)

India flagHyderabad, India
$20.00/hrIntermediateCVATLabel StudioRoboflow

Key Skills

Software

CVATCVAT
Label StudioLabel Studio
RoboflowRoboflow
TolokaToloka

Top Subject Matter

LLM Evaluation
Developer Tooling
Code and Language Response Assessment

Top Data Types

AudioAudio
ImageImage
TextText

Top Task Types

ClassificationClassification
Evaluation/RatingEvaluation/Rating
Question AnsweringQuestion Answering
Text SummarizationText Summarization

Freelancer Overview

AI Response Evaluator (Chatterly AI Project). Brings 4+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Technology, Vellore Institute of Technology - Amaravati (2025) and Intermediate, Sri Chaitanya Junior College (2021). AI-training focus includes data types such as Computer Code and Programming and labeling workflows including Evaluation and Rating.

IntermediateEnglish

Labeling Experience

LLM Output Validator & Test Data Manager (Veassure/AI Full Stack Engineer)

Tested AI-generated code and performed rigorous validation of automated test cases produced by LLMs for API testing workflows. Assessed self-healing automation loops that utilized LLM feedback to regenerate and correct compilation errors in test scripts. Analyzed dynamic mock payload generation and backend validation utilities for robustness and accuracy. • Used orchestration scripts and OpenAPI/Swagger parsing for dynamic test generation. • Validated system behavior using human-in-the-loop assessment procedures. • Provided feedback on code correctness, logical flaws, and edge cases to QA teams. • Ensured adherence to schema constraints and enhanced developer usability through detailed commentary.

Tested AI-generated code and performed rigorous validation of automated test cases produced by LLMs for API testing workflows. Assessed self-healing automation loops that utilized LLM feedback to regenerate and correct compilation errors in test scripts. Analyzed dynamic mock payload generation and backend validation utilities for robustness and accuracy. • Used orchestration scripts and OpenAPI/Swagger parsing for dynamic test generation. • Validated system behavior using human-in-the-loop assessment procedures. • Provided feedback on code correctness, logical flaws, and edge cases to QA teams. • Ensured adherence to schema constraints and enhanced developer usability through detailed commentary.

2025 - Present

AI Response Evaluator (Chatterly AI Project)

Conducted qualitative and quantitative assessments of LLM-generated code and text responses across various programming and language tasks. Evaluated AI outputs for semantic accuracy, logical consistency, and functional correctness, providing detailed feedback for iterative model improvement. Benchmarked performance through direct comparison of multiple LLM models to identify error cases and hallucinations. • Used Sentence-BERT pipelines and human evaluation criteria to rank AI responses. • Logged prompts, model outputs, scoring metrics, and reviewer judgments to facilitate focused fine-tuning. • Provided structured developer feedback to enhance model usability and reliability. • Collaborated on the design and maintenance of data collection and evaluation processes.

Conducted qualitative and quantitative assessments of LLM-generated code and text responses across various programming and language tasks. Evaluated AI outputs for semantic accuracy, logical consistency, and functional correctness, providing detailed feedback for iterative model improvement. Benchmarked performance through direct comparison of multiple LLM models to identify error cases and hallucinations. • Used Sentence-BERT pipelines and human evaluation criteria to rank AI responses. • Logged prompts, model outputs, scoring metrics, and reviewer judgments to facilitate focused fine-tuning. • Provided structured developer feedback to enhance model usability and reliability. • Collaborated on the design and maintenance of data collection and evaluation processes.

2025 - Present

Education

V

Vellore Institute of Technology - Amaravati

Bachelor of Technology, Computer Science Engineering with Specialization in Artificial Intelligence and Machine Learning

Bachelor of Technology
2021 - 2025
S

Sri Chaitanya Junior College

Intermediate, Mathematics, Physics, and Chemistry

Intermediate
2019 - 2021

Work History

V

Vedya Labs

AI Full Stack Engineer

Hyderabad
2025 - Present
C

Capgemini

Industrial Operations Engineer Intern

Hyderabad
2025 - 2025