Rajdeep Hazra - AI Response Evaluator (Chatterly AI Project)

Key Skills

Software

CVAT

Label Studio

Roboflow

Toloka

Top Subject Matter

LLM Evaluation

Developer Tooling

Code and Language Response Assessment

Top Data Types

Audio

Image

Text

Top Task Types

Classification

Evaluation/Rating

Question Answering

Text Summarization

Freelancer Overview

AI Response Evaluator (Chatterly AI Project). Brings 4+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Technology, Vellore Institute of Technology - Amaravati (2025) and Intermediate, Sri Chaitanya Junior College (2021). AI-training focus includes data types such as Computer Code and Programming and labeling workflows including Evaluation and Rating.

IntermediateEnglish

Labeling Experience

LLM Output Validator & Test Data Manager (Veassure/AI Full Stack Engineer)

Tested AI-generated code and performed rigorous validation of automated test cases produced by LLMs for API testing workflows. Assessed self-healing automation loops that utilized LLM feedback to regenerate and correct compilation errors in test scripts. Analyzed dynamic mock payload generation and backend validation utilities for robustness and accuracy. • Used orchestration scripts and OpenAPI/Swagger parsing for dynamic test generation. • Validated system behavior using human-in-the-loop assessment procedures. • Provided feedback on code correctness, logical flaws, and edge cases to QA teams. • Ensured adherence to schema constraints and enhanced developer usability through detailed commentary.

2025 - Present

AI Response Evaluator (Chatterly AI Project)

Conducted qualitative and quantitative assessments of LLM-generated code and text responses across various programming and language tasks. Evaluated AI outputs for semantic accuracy, logical consistency, and functional correctness, providing detailed feedback for iterative model improvement. Benchmarked performance through direct comparison of multiple LLM models to identify error cases and hallucinations. • Used Sentence-BERT pipelines and human evaluation criteria to rank AI responses. • Logged prompts, model outputs, scoring metrics, and reviewer judgments to facilitate focused fine-tuning. • Provided structured developer feedback to enhance model usability and reliability. • Collaborated on the design and maintenance of data collection and evaluation processes.

2025 - Present

Education

V

Vellore Institute of Technology - Amaravati

Bachelor of Technology, Computer Science Engineering with Specialization in Artificial Intelligence and Machine Learning

Bachelor of Technology

2021 - 2025

S

Sri Chaitanya Junior College

Intermediate, Mathematics, Physics, and Chemistry

Intermediate

2019 - 2021

Work History

V

Vedya Labs

AI Full Stack Engineer

Hyderabad

2025 - Present

C

Capgemini

Industrial Operations Engineer Intern

Hyderabad

2025 - 2025