For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Waqar Ahmed

Waqar Ahmed

Advance AI Data Trainer

Pakistan flagLahore, Pakistan
$15.00/hrExpertAppenClickworkerCrowdsource

Key Skills

Software

AppenAppen
ClickworkerClickworker
CrowdSourceCrowdSource
Data Annotation TechData Annotation Tech
Google Cloud Vertex AIGoogle Cloud Vertex AI
LabelboxLabelbox
Label StudioLabel Studio
Other

Top Subject Matter

No subject matter listed

Top Data Types

AudioAudio
ImageImage
TextText

Top Task Types

Bounding Box
Data Collection
Fine Tuning
Prompt Response Writing SFT
RLHF

Freelancer Overview

I am an experienced AI Data Trainer and Data Labeling Specialist with four years of expertise in training, evaluating, and optimizing AI models, including Large Language Models (LLMs). I have worked extensively on data annotation, model ranking, response evaluation, and AI safety testing, ensuring that machine learning models generate accurate, contextually relevant, and high-quality outputs. My role involved fine-tuning AI behavior, detecting hallucinations, transcribing multilingual content, and categorizing data for AI training, making me highly skilled in reinforcement learning (RLHF), supervised fine-tuning (SFT), and model validation. Beyond AI training, I bring a strong background in customer service operations and process optimization, having worked on large-scale projects with companies like Uber Eats, DoorDash, and Foodpanda. I have successfully led data-driven projects, quality audits, and AI-assisted content curation, ensuring seamless data enrichment and model improvement. My technical proficiency includes tools such as Salesforce, Zendesk, Jira, and other data management platforms, allowing me to bridge the gap between AI training and real-world applications. As a native Urdu and Punjabi speaker with advanced English proficiency, I am well-equipped for multilingual AI training and linguistic model enhancement.

ExpertUrduArabicEnglishPunjabi

Labeling Experience

Trajectory Based Work

OtherVideoText GenerationRLHF
I have actively contributed to trajectory-based AI training as an OpenAI Operator, focusing on refining LLMs (Large Language Models) through reinforcement learning and supervised fine-tuning (SFT). My work involved guiding AI behavior by generating high-quality training trajectories, ensuring that models learn patterns, context, and reasoning in a structured manner. Key tasks included: Creating and curating training dialogues to teach AI models natural, human-like interactions. Evaluating model responses and providing ranked feedback to improve coherence, factual accuracy, and ethical alignment. Identifying inconsistencies or biases in AI-generated text and refining datasets to enhance performance. Fine-tuning AI responses for improved decision-making by adjusting reward signals and reinforcement learning parameters.

I have actively contributed to trajectory-based AI training as an OpenAI Operator, focusing on refining LLMs (Large Language Models) through reinforcement learning and supervised fine-tuning (SFT). My work involved guiding AI behavior by generating high-quality training trajectories, ensuring that models learn patterns, context, and reasoning in a structured manner. Key tasks included: Creating and curating training dialogues to teach AI models natural, human-like interactions. Evaluating model responses and providing ranked feedback to improve coherence, factual accuracy, and ethical alignment. Identifying inconsistencies or biases in AI-generated text and refining datasets to enhance performance. Fine-tuning AI responses for improved decision-making by adjusting reward signals and reinforcement learning parameters.

2023 - 2024

Data Annotation & Model Ranking

OtherTextRLHFFine Tuning
Evaluating and ranking AI-generated responses for quality, factual accuracy, and contextual appropriateness.

Evaluating and ranking AI-generated responses for quality, factual accuracy, and contextual appropriateness.

2022 - 2023
CrowdSource

Text Categorization & Sentiment Analysis

CrowdsourceTextBounding Box
Labeling and classifying textual data based on sentiment, intent, and relevance for AI model training.

Labeling and classifying textual data based on sentiment, intent, and relevance for AI model training.

2022 - 2022

Education

V

Virtual University of Pakistan

MBA, Business Administration

MBA
2007 - 2010

Work History

I

Invisible Technologies

Advance AI Data Trainer

New York
2021 - 2024
I

Invisible Technologies

Advance AI Data Trainer

New York
2021 - 2024