Zaid Rajput - AI Trainer

Key Skills

Software

Other

Micro1

Mindrift

Scale AI

Top Subject Matter

Large Language Models

Agents Domain Expertise

AI Training Data

Top Data Types

Text

Audio

Document

Top Task Types

Fine Tuning

Text Generation

RLHF

Computer Programming Coding

Transcription

Text Summarization

Object Detection

Freelancer Overview

I have hands-on experience in AI training data workflows, including Reinforcement Learning (RL), Reinforcement Learning from Human Feedback (RLHF), and Supervised Fine-Tuning (SFT). I’ve worked on annotating and curating high-quality datasets for LLMs, including tasks like response ranking, preference comparison, instruction tuning, and evaluating model outputs for accuracy, safety, and coherence. My role has involved acting as an annotator to provide structured feedback, label edge cases, and ensure consistency across datasets, which directly improves model alignment and performance. Beyond labeling, I understand the full training pipeline, how SFT builds baseline model behavior and how RLHF refines it through human feedback loops. I’ve also contributed to prompt design, dataset cleaning, and iterative evaluation processes, ensuring data quality at scale. With a strong background in Python, AI tools, and real-world AI system development, I bring both technical depth and practical insight into how high-quality annotations translate into better-performing AI models.

ExpertEnglish

Labeling Experience

AI Trainer

Computer Code ProgrammingPrompt Response Writing SFT

As an AI Trainer at Revelo, I create datasets and train large language models (LLMs) and agents for various tasks. I perform supervised fine-tuning (SFT), human feedback integration (HFI), reinforcement learning (RL), and data annotation work. My focus is on improving model performance through the preparation and labeling of high-quality text-based data. • Create and curate datasets specific to AI training requirements. • Conduct SFT and fine-tuning for LLMs and AI agents. • Annotate and label conversational and task-oriented data. • Perform RL and HFI to enhance model outputs and agent behaviors.

2025 - Present

AI Trainer - Data Specialist

Computer Code ProgrammingRLHF

Did einforcement Learning (RL), Reinforcement Learning from Human Feedback (RLHF), and Supervised Fine-Tuning (SFT). I’ve worked on annotating and curating high-quality datasets for LLMs

2024 - Present

Freelance AI Agent Developer

OtherTextFine Tuning

As a freelance AI Agent Developer, I have trained and fine-tuned LLM prompts for contextual, business-related chatbot conversations. My work includes developing and optimizing multi-channel chat and voice agents, as well as integrating speech-to-text (STT) and text-to-speech (TTS) components. Data labeling involves contextualizing responses and rating model-generated answers to achieve high operational performance. • Project-based fine-tuning of LLM prompts for chatbots and voice assistants. • Label and annotate data for dialogue management and agent training. • Provide feedback and ratings for AI model responses in real business scenarios. • Integrate leading TTS/STT platforms in agent workflows and analyze labeled logs.

2021 - Present

AI Trainer and Agent Developer

VideoPrompt Response Writing SFT

As an AI Trainer and Agent Developer at Turing, I was responsible for training LLMs and agents and performing data annotation for AI models. My role included supervised fine-tuning (SFT), human feedback integration (HFI), reinforcement learning with human feedback (RLHF), and evaluation of AI-generated outputs. I also focused on model customization, label creation, and performance optimization through hands-on annotation and rating. • Fine-tune and adapt LLMs to meet business objectives using SFT and RLHF. • Annotate and label textual data for use in AI and agent training. • Evaluate model outputs and provide ratings to guide training direction. • Integrate models with databases and APIs, optimizing results for production.

2025 - 2026

Education

F

FAST NUCES Islamabad

Bachelor of Science, Software Engineering

Bachelor of Science

2020 - 2024

Work History

F

Freelancer

AI Agent Developer

N/A

2021 - Present

I

iMobile

Full Stack Developer

Remote

2022 - 2025