For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
M

Manju Devi

LLM-Assisted Product Data Enrichment Pipeline

India flagN/A, India
ExpertOther

Key Skills

Software

Other

Top Subject Matter

E-commerce Product Data
General Machine Learning Training Data

Top Data Types

TextText

Top Task Types

ClassificationClassification

Freelancer Overview

LLM-Assisted Product Data Enrichment Pipeline. Brings 9+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Other. Education includes Bachelor of Science, N/A (2018). AI-training focus includes data types such as Text and labeling workflows including Classification.

Expert

Labeling Experience

Freelance AI Data Labeling and Validation (as part of Python Developer & Web Scraping Specialist role)

OtherTextClassification
Developed LLM-assisted pipelines to clean, classify, and validate large-scale datasets used for AI/ML training. Designed post-processing logic for prompt engineering tasks aimed at improving model input quality. Performed manual reviews on automatically labeled data to ensure adherence to training requirements. • Used OpenRouter API and ChatGPT for enhancing labeled data accuracy. • Optimized annotation pipelines for efficiency and quality. • Delivered validated datasets in various formats after multi-stage processing. • Provided quality reports alongside labeled data for client verification.

Developed LLM-assisted pipelines to clean, classify, and validate large-scale datasets used for AI/ML training. Designed post-processing logic for prompt engineering tasks aimed at improving model input quality. Performed manual reviews on automatically labeled data to ensure adherence to training requirements. • Used OpenRouter API and ChatGPT for enhancing labeled data accuracy. • Optimized annotation pipelines for efficiency and quality. • Delivered validated datasets in various formats after multi-stage processing. • Provided quality reports alongside labeled data for client verification.

2019 - Present

LLM-Assisted Product Data Enrichment Pipeline

OtherTextClassification
Integrated OpenRouter API (Claude + GPT-4o) to auto-classify and standardize product data for ML model training. Applied validation logic to cross-check LLM-generated labels, ensuring data reliability for supervised learning. Flagged ambiguous or low-confidence outputs for human review and further curation. • Labeled and cleaned 25,000 product records for training data pipelines. • Applied prompt engineering to instruct LLMs on classification and data normalization. • Executed cross-checks to maintain quality standards in the annotated dataset. • Reduced manual data cleaning workload by incorporating automated checks.

Integrated OpenRouter API (Claude + GPT-4o) to auto-classify and standardize product data for ML model training. Applied validation logic to cross-check LLM-generated labels, ensuring data reliability for supervised learning. Flagged ambiguous or low-confidence outputs for human review and further curation. • Labeled and cleaned 25,000 product records for training data pipelines. • Applied prompt engineering to instruct LLMs on classification and data normalization. • Executed cross-checks to maintain quality standards in the annotated dataset. • Reduced manual data cleaning workload by incorporating automated checks.

2024 - 2024

Education

N

N/A

Bachelor of Science, Computer Science / Information Technology

Bachelor of Science
2018 - 2018

Work History

S

Self-Employed

Python Developer & Freelance Web Scraping Specialist

N/A
2019 - Present
I

IT Services Company

Junior Software Developer

N/A
2018 - 2019