Start Up - LLM Fine-Tuning and Data Labeling Lead

Key Skills

Software

AWS SageMaker

Anno-Mage

Appen

Argilla

Axiom AI

Clickworker

CloudFactory

CrowdFlower

CrowdSource

CVAT

Data Annotation Tech

Dataloop

Datatroniq

Datumbox

Datasaur

Datature

Dataturk

Deep Systems

Diffgram

Doccano

Encord

Figure Eight

Google Cloud Vertex AI

Hasty

HiveMind

Humanatic

iMerit

Img Lab

Kili Technology

Labelbox

LabelImg

Label Studio

LightTag

Lionbridge

Micro1

Mercor

Mighty AI

Mindrift

OneForma

OpenCV AI Kit (OAK)

Playment

Prodigy

Redbrick AI

Remotasks

Roboflow

Sama

Scale AI

Sloth

Snorkel AI

SuperAnnotate

Supervisely

Surge AI

Tagtog

Toloka

Telus

Trilldata Technologies

VoTT

V7 Labs

Top Subject Matter

Large Language Models

Enterprise Domain Texts

Document Image Classification

Top Data Types

Text

Image

Document

Top Task Types

Fine-tuning

Classification

Bounding Box

Polygon

Segmentation

Entity (NER) Classification

Point/Key Point

Polyline

Cuboid

Object Detection

Text Generation

Question Answering

Text Summarization

RLHF

Red Teaming

Transcription

Evaluation/Rating

Computer Programming/Coding

Data Collection

Function Calling

Prompt + Response Writing (SFT)

Freelancer Overview

LLM Fine-Tuning and Data Labeling Lead. Brings 9+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include MLflow and Gradio. Education includes Master of Science, Himalayan University (2020) and Bachelor of Science, Himalayan University (2018). AI-training focus includes data types such as Text and Image and labeling workflows including Fine-tuning and Classification.

IntermediateEnglish

Labeling Experience

Vision+LLM Multi-modal Labeling Specialist

ImageClassification

Developed image classification pipelines using EfficientNet-B3 for custom datasets of 2,000 labeled images deployed for document understanding and visual question answering. Managed manual and automated labeling workflows to achieve target accuracy and reduction in review time. Used Gradio for interactive validation of model predictions and label consistency. • Supervised manual and semi-automated image labeling tasks • Coordinated workflow across vision and document QA domains • Ensured dataset quality for transfer learning applications • Achieved review time savings of 20 hours per week

2024 - Present

LLM Fine-Tuning and Data Labeling Lead

TextFine Tuning

Fine-tuned LLaMA 2 and Mistral-7B language models utilizing LoRA/QLoRA (PEFT) methods applied to domain-specific text. Labeled and curated training datasets to improve task accuracy and reduce annotation costs across multiple iterations. Evaluated output quality and maintained experiment tracking via MLflow. • Used domain-specific and generic datasets for LLM tuning • Implemented annotation guidelines and QA procedures • Integrated labeled data pipelines with MLflow for reproducibility • Improved annotation efficiency lowering costs by up to 50%

2024 - Present

Education

H

Himalayan University

Master of Science, Computer Science

Master of Science

2020 - 2020

H

Himalayan University

Bachelor of Science, Computer Science

Bachelor of Science

2018 - 2018

Work History

L

LRS Services

Software Developer & AI/ML Engineer

Delhi

2024 - Present

P

Prakhar Software Solutions

Software Developer

Delhi

2022 - 2024