For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
S

Sachin Bind

LLM Data Evaluation and Benchmarking Intern

India flagNagpur, India
$8.00/hrIntermediateCrowdsourceAws SagemakerAppen

Key Skills

Software

CrowdSourceCrowdSource
AWS SageMakerAWS SageMaker
AppenAppen

Top Subject Matter

Large Language Models (LLMs)
Model Evaluation
Data-centric Machine Learning

Top Data Types

TextText
ImageImage

Top Task Types

Data CollectionData Collection
PolygonPolygon
Fine-tuningFine-tuning
Computer Programming/CodingComputer Programming/Coding
Function CallingFunction Calling
Prompt + Response Writing (SFT)Prompt + Response Writing (SFT)
Evaluation/RatingEvaluation/Rating

Freelancer Overview

LLM Data Evaluation and Benchmarking Intern. Brings 1+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Technology, G. H. Raisoni College of Engineering (2026). AI-training focus includes data types such as Text and labeling workflows including Evaluation, Rating, and Data Collection.

IntermediateEnglish

Labeling Experience

ML Data Workflow and Curation Intern

TextData Collection
During my internship at Exoclass, I contributed to dataset preprocessing and quality-focused machine learning development, including data-centric workflows for validation, augmentation, and feature processing. I participated in gathering, cleansing, and preparing text data to improve model generalization and reduce overfitting in multi-modal AI pipelines. My tasks strengthened dataset reliability for both training and evaluation purposes. • Orchestrated data collection and preprocessing for multi-modal transformer tasks • Applied data validation and augmentation processes • Enhanced feature engineering for improved downstream ML tasks • Optimized datasets to improve training and evaluation quality

During my internship at Exoclass, I contributed to dataset preprocessing and quality-focused machine learning development, including data-centric workflows for validation, augmentation, and feature processing. I participated in gathering, cleansing, and preparing text data to improve model generalization and reduce overfitting in multi-modal AI pipelines. My tasks strengthened dataset reliability for both training and evaluation purposes. • Orchestrated data collection and preprocessing for multi-modal transformer tasks • Applied data validation and augmentation processes • Enhanced feature engineering for improved downstream ML tasks • Optimized datasets to improve training and evaluation quality

2025 - 2025

LLM Data Evaluation and Benchmarking Intern

Text
At UCN India, I contributed to structured data preparation, validation, and benchmarking workflows designed to support reliable LLM model iteration and evaluation. My responsibilities included preparing text data for large language model training, implementing evaluation benchmarks to assess model factuality, and building pipelines for drift detection and latency tracing. This work enabled robust monitoring and improved model response quality in production settings. • Designed and executed benchmark evaluations for LLM factuality and response quality • Built data validation pipelines to ensure clean and accurate training data • Implemented drift detection and observability tools to monitor performance changes • Supported large-model experimentation via distributed training workflows

At UCN India, I contributed to structured data preparation, validation, and benchmarking workflows designed to support reliable LLM model iteration and evaluation. My responsibilities included preparing text data for large language model training, implementing evaluation benchmarks to assess model factuality, and building pipelines for drift detection and latency tracing. This work enabled robust monitoring and improved model response quality in production settings. • Designed and executed benchmark evaluations for LLM factuality and response quality • Built data validation pipelines to ensure clean and accurate training data • Implemented drift detection and observability tools to monitor performance changes • Supported large-model experimentation via distributed training workflows

2025 - 2025

Education

G

G. H. Raisoni College of Engineering

Bachelor of Technology, Computer Science and Engineering

Bachelor of Technology
2022 - 2026

Work History

E

Exoclass

Software Engineer Intern

N/A
2025 - 2025
U

UCN India

Software Engineer Intern

Pune
2025 - 2025