For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
S

Shobhina Varshney

Data Annotator (Freelancer)

INDIA flag
Hyderabad, India
$40.00/hrIntermediateOther

Key Skills

Software

Other

Top Subject Matter

Programming/LLM Optimization
Generative AI/LLM Knowledge Systems
AI Data Engineering

Top Data Types

TextText

Top Task Types

Fine Tuning

Freelancer Overview

Data Annotator (Freelancer). Brings 8+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal, Proprietary Tooling, and Other. Education includes Bachelor of Technology, Gyan Mahavidyalaya and Master of Business Administration, Institute of Information Management & Technology. AI-training focus includes data types such as Computer Code, Programming, and Text and labeling workflows including Computer Programming, Coding, and Fine-tuning.

IntermediateEnglish

Labeling Experience

AI Data Curation & Pipeline Architect

TextFine Tuning
Helped architect and curate datasets for generative AI system development and optimization. Implemented a RAG pipeline and curation workflow to ensure data quality and effective model responses. Analyzed and logged token usage and system performance for AI training results. • Engineered and maintained vector database-backed semantic retrieval workflows for LLM training. • Curated and prioritized training samples using Python (Pandas) for dataset refinement. • Leveraged tools like LangChain, Pinecone, FAISS, and internal scripting for preprocessing. • Focused on data pipeline optimization for rapid and accurate AI model tuning.

Helped architect and curate datasets for generative AI system development and optimization. Implemented a RAG pipeline and curation workflow to ensure data quality and effective model responses. Analyzed and logged token usage and system performance for AI training results. • Engineered and maintained vector database-backed semantic retrieval workflows for LLM training. • Curated and prioritized training samples using Python (Pandas) for dataset refinement. • Leveraged tools like LangChain, Pinecone, FAISS, and internal scripting for preprocessing. • Focused on data pipeline optimization for rapid and accurate AI model tuning.

2024 - Present

Data Annotator (Freelancer)

Evaluated and annotated diverse code snippets in multiple programming languages to ensure accuracy, efficiency, and clarity. Applied structured labeling guidelines and detailed feedback to improve dataset consistency for AI training. Researched and implemented preprocessing steps for formatting data to fine-tune large language models (LLMs). • Reviewed and annotated code in Python, Java, and C++ for correctness and optimization. • Provided structured logic corrections and documentation clarifications for improved AI performance. • Utilized internal/proprietary tooling for managing and tracking data labeling work. • Contributed to data quality initiatives critical for LLM semantic understanding and model fine-tuning.

Evaluated and annotated diverse code snippets in multiple programming languages to ensure accuracy, efficiency, and clarity. Applied structured labeling guidelines and detailed feedback to improve dataset consistency for AI training. Researched and implemented preprocessing steps for formatting data to fine-tune large language models (LLMs). • Reviewed and annotated code in Python, Java, and C++ for correctness and optimization. • Provided structured logic corrections and documentation clarifications for improved AI performance. • Utilized internal/proprietary tooling for managing and tracking data labeling work. • Contributed to data quality initiatives critical for LLM semantic understanding and model fine-tuning.

2025 - 2025

Data Engineer (AI/ML Pipeline)

OtherTextFine Tuning
Experimented with preparing datasets in LLM-ready annotation formats using Hugging Face Transformers. Aggregated, cleaned, and transformed data using Pandas and NumPy for model training. Served processed data to downstream applications, enabling actionable business insights and efficient AI pipeline integration. • Designed data flows optimized for AI/ML consumption in a commercial environment. • Prototyped fine-tuning data structures using Python and open-source libraries. • Ensured data consistency and high quality across diverse sources. • Contributed to AI project deployment by transforming and annotating text datasets.

Experimented with preparing datasets in LLM-ready annotation formats using Hugging Face Transformers. Aggregated, cleaned, and transformed data using Pandas and NumPy for model training. Served processed data to downstream applications, enabling actionable business insights and efficient AI pipeline integration. • Designed data flows optimized for AI/ML consumption in a commercial environment. • Prototyped fine-tuning data structures using Python and open-source libraries. • Ensured data consistency and high quality across diverse sources. • Contributed to AI project deployment by transforming and annotating text datasets.

2024 - 2024

Education

I

Institute of Information Management & Technology

Master of Business Administration, Human Resources

Master of Business Administration
Not specified
G

Gyan Mahavidyalaya

Bachelor of Technology, Computer Science Engineering (Artificial Intelligence and Machine Learning)

Bachelor of Technology
Not specified

Work History

A

Amazon

Full Stack Engineer

Bangalore
2024 - Present
A

Amazon

Data Engineer

Bangalore
2024 - 2024