For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Jesse Ngugi

Jesse Ngugi

AI Trainer and Data Specialist - Machine Learning

Kenya flagNairobi, Kenya
$50.00/hrIntermediateMercorLabelboxCVAT

Key Skills

Software

MercorMercor
LabelboxLabelbox
CVATCVAT

Top Subject Matter

Electrical and electronics
Big data analysis
Machine learning.

Top Data Types

TextText
DocumentDocument
VideoVideo
ImageImage

Top Task Types

Entity Ner Classification
Text Generation
Prompt Response Writing SFT
Fine Tuning
Audio Recording
Transcription
Action Recognition
Mapping
Data Collection
Evaluation Rating

Freelancer Overview

I am an experienced AI data specialist with over six years supporting machine learning and AI systems through high-quality data labeling, annotation, and workflow validation. My background spans large-scale projects in natural language processing, content evaluation, and automation logic, where I have consistently delivered accurate, reliable datasets for foundation model training and evaluation. I excel at designing and executing structured annotation workflows, performing error analysis, and maintaining detailed technical documentation. I am highly adaptable to new tools and thrive in remote, asynchronous environments, always ensuring quality, consistency, and timely delivery across diverse AI data initiatives.

IntermediateFrenchGermanEnglish

Labeling Experience

CVAT

AI Data Contributor

CVATImageFine TuningEvaluation Rating
I participated in early-stage AI data programs focused on content evaluation, data validation, and quality assurance to support the development of reliable machine learning datasets. My role involved executing rule-based decision workflows that required precise judgment, attention to detail, and strict adherence to consistency standards. I systematically logged errors, anomalies, and improvement suggestions to help refine datasets and optimize downstream model performance. Working in a fully remote environment, I developed strong self-management skills, maintaining productivity, accuracy, and discipline while handling complex technical tasks independently.

I participated in early-stage AI data programs focused on content evaluation, data validation, and quality assurance to support the development of reliable machine learning datasets. My role involved executing rule-based decision workflows that required precise judgment, attention to detail, and strict adherence to consistency standards. I systematically logged errors, anomalies, and improvement suggestions to help refine datasets and optimize downstream model performance. Working in a fully remote environment, I developed strong self-management skills, maintaining productivity, accuracy, and discipline while handling complex technical tasks independently.

2024 - 2025
CVAT

AI Training & Evaluation Analyst

CVATVideoEntity Ner ClassificationAction Recognition
During my work on multiple AI training projects, I supported data annotation, linguistic evaluation, and workflow testing to ensure the creation of high-quality datasets for machine learning models. I carefully followed detailed task instructions and style guides to deliver consistent, accurate outputs while balancing high-volume workloads under tight deadlines. I proactively identified logical errors and edge cases within task pipelines, escalating issues with clear technical explanations to facilitate resolution and improve overall workflow reliability. Additionally, I quickly adapted to new tools, interfaces, and evolving project requirements, maintaining productivity and accuracy across diverse AI training initiatives.

During my work on multiple AI training projects, I supported data annotation, linguistic evaluation, and workflow testing to ensure the creation of high-quality datasets for machine learning models. I carefully followed detailed task instructions and style guides to deliver consistent, accurate outputs while balancing high-volume workloads under tight deadlines. I proactively identified logical errors and edge cases within task pipelines, escalating issues with clear technical explanations to facilitate resolution and improve overall workflow reliability. Additionally, I quickly adapted to new tools, interfaces, and evolving project requirements, maintaining productivity and accuracy across diverse AI training initiatives.

2024 - 2025
Labelbox

Senior AI Data Specialist

LabelboxDocumentFine TuningPrompt Response Writing SFT
As a Senior AI Data Specialist at Scale AI, I contributed to the development of high-quality supervised fine-tuning and evaluation datasets across a range of text-based AI tasks. My responsibilities centered on producing reliable ground-truth data to support model training, validation, and performance benchmarking for advanced language systems. I designed and implemented structured labeling workflows that ensured clarity, consistency, and scalability across annotation cycles. A core component of my role involved performing cross-validation and secondary reviews to verify dataset integrity and reduce variance in labeling outputs. I analyzed edge cases, resolved inconsistencies, and applied systematic quality-control measures to strengthen overall dataset robustness. Additionally, I reviewed automation-style task flows and logical sequences to ensure procedural correctness, coherent reasoning structures, and compliance with project specifications.

As a Senior AI Data Specialist at Scale AI, I contributed to the development of high-quality supervised fine-tuning and evaluation datasets across a range of text-based AI tasks. My responsibilities centered on producing reliable ground-truth data to support model training, validation, and performance benchmarking for advanced language systems. I designed and implemented structured labeling workflows that ensured clarity, consistency, and scalability across annotation cycles. A core component of my role involved performing cross-validation and secondary reviews to verify dataset integrity and reduce variance in labeling outputs. I analyzed edge cases, resolved inconsistencies, and applied systematic quality-control measures to strengthen overall dataset robustness. Additionally, I reviewed automation-style task flows and logical sequences to ensure procedural correctness, coherent reasoning structures, and compliance with project specifications.

2023 - 2024
Mercor

AI Language & Data Trainer

MercorTextEntity Ner ClassificationText Generation
From May 2024 to March 2025, I worked as an AI Language & Data Trainer with Mercor, contributing to the development and evaluation of large-scale English language datasets used in training foundation models. My role focused on annotating, validating, and refining high-volume textual data to improve model accuracy, contextual understanding, and response reliability. I applied strict quality-control frameworks to ensure consistency, linguistic precision, and adherence to continuously evolving annotation guidelines. This required careful interpretation of task instructions, calibration against benchmark examples, and proactive identification of ambiguities within dataset specifications.

From May 2024 to March 2025, I worked as an AI Language & Data Trainer with Mercor, contributing to the development and evaluation of large-scale English language datasets used in training foundation models. My role focused on annotating, validating, and refining high-volume textual data to improve model accuracy, contextual understanding, and response reliability. I applied strict quality-control frameworks to ensure consistency, linguistic precision, and adherence to continuously evolving annotation guidelines. This required careful interpretation of task instructions, calibration against benchmark examples, and proactive identification of ambiguities within dataset specifications.

2023 - 2024

Education

N

N/A

Bachelor of Engineering, Electrical and Electronics Engineering

Bachelor of Engineering
2012 - 2016

Work History

A

Appen / Figure Eight

AI Training & Evaluation Analyst

New South Wales
2024 - 2025
M

Mercor (AI Training Platform)

AI Language & Data Trainer

California
2024 - 2025