For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
S

Sonu Kumar

LLM Fine-Tuning/Text Summarization Project/AI agent building

INDIA flag
Delhi, India
$10.00/hrIntermediateOtherAws SagemakerData Annotation Tech

Key Skills

Software

Other
AWS SageMakerAWS SageMaker
Data Annotation TechData Annotation Tech

Top Subject Matter

Recipe Text Summarization
Relationship Chatbot for Sentence Suggestion based on Conversation Stage
AI Search Engine Chatbot Using Tavily and Qwen Model

Top Data Types

TextText
ImageImage
AudioAudio

Top Task Types

Text Summarization
Segmentation
Entity Ner Classification
Text Generation
Fine Tuning
RLHF
Computer Programming Coding
Evaluation Rating
Transcription
Function Calling
Prompt Response Writing SFT
Object Detection
Data Collection

Freelancer Overview

I have experience working closely with AI training data through building and evaluating machine learning and LLM-based systems. In my projects, I’ve handled data preprocessing, cleaning, and structuring large datasets (such as 14K+ records for text summarization), ensuring high-quality inputs for model training. I’ve also worked on improving model outputs by refining prompts, reducing hallucinations, and evaluating responses based on accuracy and relevance skills that directly relate to high-quality data labeling and annotation practices. Additionally, I’ve developed real-time AI systems and multi-agent pipelines where consistent and well-structured data was critical for performance. My work involved analyzing outputs, identifying edge cases, and iteratively improving system behavior, which aligns closely with training data validation and quality control. What sets me apart is my strong understanding of how data quality impacts model performance, allowing me to approach labeling and AI training tasks with both precision and practical insight.

IntermediateEnglish

Labeling Experience

Data annotation

ImageClassification
at RWS Compay i have worked on image classification and missing part identification

at RWS Compay i have worked on image classification and missing part identification

2025 - 2025

LLM Fine-Tuning/Text Summarization Project

OtherTextText Summarization
I processed a dataset of over 14,000 recipe texts and fine-tuned a LLaMA-based sequence-to-sequence model for text summarization. My work involved preparing data through comprehensive preprocessing, including tokenization and evaluation with ROUGE metrics. I designed and evaluated the summarization pipeline for optimal performance using standard benchmarks. • Conducted web scraping and data collection for text data extraction. • Preprocessed and tokenized large volumes of recipe text data for model input. • Fine-tuned a large language model specifically for text summarization tasks. • Assessed model performance using ROUGE-1 metric, achieving a score of 52.70.

I processed a dataset of over 14,000 recipe texts and fine-tuned a LLaMA-based sequence-to-sequence model for text summarization. My work involved preparing data through comprehensive preprocessing, including tokenization and evaluation with ROUGE metrics. I designed and evaluated the summarization pipeline for optimal performance using standard benchmarks. • Conducted web scraping and data collection for text data extraction. • Preprocessed and tokenized large volumes of recipe text data for model input. • Fine-tuned a large language model specifically for text summarization tasks. • Assessed model performance using ROUGE-1 metric, achieving a score of 52.70.

2024 - 2024

Education

I

Indraprastha Institute of Information Technology, Delhi

Master of Technology, Computer Science and Engineering

Master of Technology
2023 - 2025
S

Shershah Engineering College, Sasaram

Bachelor of Technology, Computer Science and Engineering

Bachelor of Technology
2019 - 2023

Work History

S

Sigmatic.ai

ML Engineer

Delhi
2025 - Present