For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
A

Anurag Singh

Data Analyst

India flagGurgaon, India
$20.00/hrEntry LevelClickworkerHivemindLabelbox

Key Skills

Software

ClickworkerClickworker
HiveMindHiveMind
LabelboxLabelbox
RemotasksRemotasks
Other

Top Subject Matter

Document AI
Intelligent Retrieval
Ocr Domain Expertise

Top Data Types

DocumentDocument
TextText
ImageImage

Top Task Types

Text GenerationText Generation
ClassificationClassification

Freelancer Overview

Data Analyst. Brings 3+ years of professional experience across complex professional workflows, research, and quality-focused execution.

Entry LevelEnglishHindi

Labeling Experience

AI Spreadsheet Classifier & Annotator

TextClassification
Developed spreadsheet validation and anomaly detection platforms using offline LLMs to assist in AI-aided data review and annotation. Built pipelines for automated data classification, anomaly detection, and data transformation in large enterprise spreadsheets. Enabled users to interact with AI for data labeling and corrections in real time. • Implemented classification logic with local LLMs for AI-assisted spreadsheet annotation. • Automated anomaly detection and suspicious entry identification with ML pipelines. • Provided real-time preview and validation of annotated or cleaned datasets. • Supported prompt engineering workflows for question answering on tabular data.

Developed spreadsheet validation and anomaly detection platforms using offline LLMs to assist in AI-aided data review and annotation. Built pipelines for automated data classification, anomaly detection, and data transformation in large enterprise spreadsheets. Enabled users to interact with AI for data labeling and corrections in real time. • Implemented classification logic with local LLMs for AI-assisted spreadsheet annotation. • Automated anomaly detection and suspicious entry identification with ML pipelines. • Provided real-time preview and validation of annotated or cleaned datasets. • Supported prompt engineering workflows for question answering on tabular data.

2024 - Present

AI Document Processing & Retrieval System Developer

DocumentText Generation
Developed and deployed AI-powered document processing and RAG platforms focused on privacy-first document analysis. Led the design and rollout of systems incorporating local LLMs, OCR, PDF chunking, and embedding generation for enterprise clients. Architected platforms that ingested, transformed, and summarized documents for intelligent retrieval and AI chatbot integration. • Built end-to-end RAG pipelines with document chunking and embedding generation for information retrieval. • Integrated OCR (Google Cloud Vision, Tesseract) for document digitization and text labeling tasks. • Engineered multilingual, voice-enabled chatbot interfaces for document question answering tasks. • Designed systems for user feedback collection and AI-assisted text transformation workflows.

Developed and deployed AI-powered document processing and RAG platforms focused on privacy-first document analysis. Led the design and rollout of systems incorporating local LLMs, OCR, PDF chunking, and embedding generation for enterprise clients. Architected platforms that ingested, transformed, and summarized documents for intelligent retrieval and AI chatbot integration. • Built end-to-end RAG pipelines with document chunking and embedding generation for information retrieval. • Integrated OCR (Google Cloud Vision, Tesseract) for document digitization and text labeling tasks. • Engineered multilingual, voice-enabled chatbot interfaces for document question answering tasks. • Designed systems for user feedback collection and AI-assisted text transformation workflows.

2024 - Present

AI Data Trainer / RLHF Contributor | Outlier AI (Remote)

TextQuestion Answering
Contributed to large-scale LLM training and evaluation pipelines across multiple RLHF and evaluation projects Performed response ranking, pairwise comparison, and justification writing to improve model alignment and output quality Designed and generated high-quality prompts (coding + multilingual + safety) to test model reasoning and robustness Executed rewrite tasks to enhance correctness, coherence, and factual accuracy of AI-generated responses Worked on coding-focused RLHF tasks, including prompt engineering and evaluation of code generation outputs Participated in multilingual evaluation (English, Hindi) ensuring localization and contextual relevance Evaluated model outputs on safety, factuality, and edge-case handling, including sensitive content scenarios Contributed to tool-usage and agent evaluation workflows, assessing model interaction with APIs and structured tools Maintained high-quality standards under strict evaluation guidelines and scoring rubrics

Contributed to large-scale LLM training and evaluation pipelines across multiple RLHF and evaluation projects Performed response ranking, pairwise comparison, and justification writing to improve model alignment and output quality Designed and generated high-quality prompts (coding + multilingual + safety) to test model reasoning and robustness Executed rewrite tasks to enhance correctness, coherence, and factual accuracy of AI-generated responses Worked on coding-focused RLHF tasks, including prompt engineering and evaluation of code generation outputs Participated in multilingual evaluation (English, Hindi) ensuring localization and contextual relevance Evaluated model outputs on safety, factuality, and edge-case handling, including sensitive content scenarios Contributed to tool-usage and agent evaluation workflows, assessing model interaction with APIs and structured tools Maintained high-quality standards under strict evaluation guidelines and scoring rubrics

2024 - 2025

Education

G

Galgotias College of Engineering & Technology

Master of Computer Applications, Computer Applications

Master of Computer Applications
2021 - 2023
S

Singhania University

Bachelor of Computer Applications, Computer Applications

Bachelor of Computer Applications
2018 - 2021

Work History

T

TRPW Strategic Partners

Data Analyst

Gurgaon
2024 - Present
O

Outlier AI

AI Data Trainer / RLHF Contributor

Gurgaon
2024 - 2025