For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
S
Start Up

Start Up

LLM Fine-Tuning and Data Labeling Lead

India flagDelhi, India
$40.00/hrIntermediateAws SagemakerAnno MageAppen

Key Skills

Software

AWS SageMakerAWS SageMaker
Anno-MageAnno-Mage
AppenAppen
ArgillaArgilla
Axiom AI
ClickworkerClickworker
CloudFactoryCloudFactory
CrowdFlowerCrowdFlower
CrowdSourceCrowdSource
CVATCVAT
Data Annotation TechData Annotation Tech
DataloopDataloop
DatatroniqDatatroniq
DatumboxDatumbox
DatasaurDatasaur
DatatureDatature
DataturkDataturk
Deep SystemsDeep Systems
DiffgramDiffgram
DoccanoDoccano
EncordEncord
Figure EightFigure Eight
Google Cloud Vertex AIGoogle Cloud Vertex AI
HastyHasty
HiveMindHiveMind
HumanaticHumanatic
iMeritiMerit
Img Lab
Kili TechnologyKili Technology
LabelboxLabelbox
LabelImgLabelImg
Label StudioLabel Studio
LightTagLightTag
LionbridgeLionbridge
Micro1
MercorMercor
Mighty AIMighty AI
MindriftMindrift
OneFormaOneForma
OpenCV AI Kit (OAK)OpenCV AI Kit (OAK)
PlaymentPlayment
ProdigyProdigy
Redbrick AIRedbrick AI
RemotasksRemotasks
RoboflowRoboflow
SamaSama
Scale AIScale AI
SlothSloth
Snorkel AISnorkel AI
SuperAnnotateSuperAnnotate
SuperviselySupervisely
Surge AISurge AI
TagtogTagtog
TolokaToloka
TelusTelus
Trilldata Technologies
VoTT
V7 LabsV7 Labs

Top Subject Matter

Large Language Models
Enterprise Domain Texts
Document Image Classification

Top Data Types

TextText
ImageImage
DocumentDocument

Top Task Types

Fine-tuningFine-tuning
ClassificationClassification
Bounding BoxBounding Box
PolygonPolygon
SegmentationSegmentation
Entity (NER) ClassificationEntity (NER) Classification
Point/Key PointPoint/Key Point
PolylinePolyline
CuboidCuboid
Object DetectionObject Detection
Text GenerationText Generation
Question AnsweringQuestion Answering
Text SummarizationText Summarization
RLHFRLHF
Red TeamingRed Teaming
TranscriptionTranscription
Evaluation/RatingEvaluation/Rating
Computer Programming/CodingComputer Programming/Coding
Data CollectionData Collection
Function CallingFunction Calling
Prompt + Response Writing (SFT)Prompt + Response Writing (SFT)

Freelancer Overview

LLM Fine-Tuning and Data Labeling Lead. Brings 9+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include MLflow and Gradio. Education includes Master of Science, Himalayan University (2020) and Bachelor of Science, Himalayan University (2018). AI-training focus includes data types such as Text and Image and labeling workflows including Fine-tuning and Classification.

IntermediateEnglish

Labeling Experience

Vision+LLM Multi-modal Labeling Specialist

ImageClassification
Developed image classification pipelines using EfficientNet-B3 for custom datasets of 2,000 labeled images deployed for document understanding and visual question answering. Managed manual and automated labeling workflows to achieve target accuracy and reduction in review time. Used Gradio for interactive validation of model predictions and label consistency. • Supervised manual and semi-automated image labeling tasks • Coordinated workflow across vision and document QA domains • Ensured dataset quality for transfer learning applications • Achieved review time savings of 20 hours per week

Developed image classification pipelines using EfficientNet-B3 for custom datasets of 2,000 labeled images deployed for document understanding and visual question answering. Managed manual and automated labeling workflows to achieve target accuracy and reduction in review time. Used Gradio for interactive validation of model predictions and label consistency. • Supervised manual and semi-automated image labeling tasks • Coordinated workflow across vision and document QA domains • Ensured dataset quality for transfer learning applications • Achieved review time savings of 20 hours per week

2024 - Present

LLM Fine-Tuning and Data Labeling Lead

TextFine Tuning
Fine-tuned LLaMA 2 and Mistral-7B language models utilizing LoRA/QLoRA (PEFT) methods applied to domain-specific text. Labeled and curated training datasets to improve task accuracy and reduce annotation costs across multiple iterations. Evaluated output quality and maintained experiment tracking via MLflow. • Used domain-specific and generic datasets for LLM tuning • Implemented annotation guidelines and QA procedures • Integrated labeled data pipelines with MLflow for reproducibility • Improved annotation efficiency lowering costs by up to 50%

Fine-tuned LLaMA 2 and Mistral-7B language models utilizing LoRA/QLoRA (PEFT) methods applied to domain-specific text. Labeled and curated training datasets to improve task accuracy and reduce annotation costs across multiple iterations. Evaluated output quality and maintained experiment tracking via MLflow. • Used domain-specific and generic datasets for LLM tuning • Implemented annotation guidelines and QA procedures • Integrated labeled data pipelines with MLflow for reproducibility • Improved annotation efficiency lowering costs by up to 50%

2024 - Present

Education

H

Himalayan University

Master of Science, Computer Science

Master of Science
2020 - 2020
H

Himalayan University

Bachelor of Science, Computer Science

Bachelor of Science
2018 - 2018

Work History

L

LRS Services

Software Developer & AI/ML Engineer

Delhi
2024 - Present
P

Prakhar Software Solutions

Software Developer

Delhi
2022 - 2024