For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Anurag Pathak

Anurag Pathak

Software Developer & Mechanical Engineer | Python, React, NLP

India flagDelhi, India
$30.00/hrEntry LevelLabelboxSnorkel AIScale AI

Key Skills

Software

LabelboxLabelbox
Snorkel AISnorkel AI
Scale AIScale AI

Top Subject Matter

Artificial Intelligence & Data Science
Software Engineering & Web Development
Mechanical Engineering & STEM

Top Data Types

TextText
Computer Code ProgrammingComputer Code Programming
DocumentDocument

Top Task Types

Text Summarization
Computer Programming Coding
Prompt Response Writing SFT
RLHF
Evaluation Rating

Freelancer Overview

I am a Software Developer and Mechanical Engineering graduate (B.Tech) specializing in Python, C++, and data analytics. My background allows me to tackle complex AI training tasks that require strict logical reasoning, mathematical modeling, and rigorous code evaluation. As an AI Research Intern at Suvidha Foundation, I gained direct experience in NLP data preparation by processing, structuring, and evaluating over 1,100 articles for text summarization datasets. I am highly proficient in cleaning complex datasets with Pandas, verifying LLM outputs for factual and logical accuracy, and integrating AI models like Hugging Face's BART. I am detail-oriented and excel at providing high-fidelity data for coding, STEM reasoning, and text-based AI training.

Entry LevelEnglishHindi

Labeling Experience

AI Research Intern—NLP Data Preparation and Summarization

OtherTextText Summarization
During my AI Research Internship at Suvidha Foundation, I compiled and processed over 1,100 news articles to support NLP summarization research using Python. I handled the annotation, cleaning, and preparation of textual data for training AI models on the CNN/DailyMail datasets. My work included developing preprocessing pipelines using Pandas and NumPy, and rigorously evaluating model outputs for factual accuracy and logical consistency. • Annotated and structured large volumes of text data specifically for NLP applications. • Validated LLM outputs, identifying hallucinations and logical errors in text summaries. • Executed data cleaning and transformation using Python libraries (Pandas). • Documented data processes to ensure reproducibility for research reporting.

During my AI Research Internship at Suvidha Foundation, I compiled and processed over 1,100 news articles to support NLP summarization research using Python. I handled the annotation, cleaning, and preparation of textual data for training AI models on the CNN/DailyMail datasets. My work included developing preprocessing pipelines using Pandas and NumPy, and rigorously evaluating model outputs for factual accuracy and logical consistency. • Annotated and structured large volumes of text data specifically for NLP applications. • Validated LLM outputs, identifying hallucinations and logical errors in text summaries. • Executed data cleaning and transformation using Python libraries (Pandas). • Documented data processes to ensure reproducibility for research reporting.

2025 - 2025

Education

M

Maharaja Agrasen Institute of Technology

Bachelor of Technology, Mechanical Engineering

Bachelor of Technology
2021 - 2025

Work History

S

Suvidha Foundation

AI Research Intern

Delhi
2025 - 2025
D

Deloitte

Data Analytics Intern

Delhi
2024 - 2024