Jesse Ngugi - AI Trainer and Data Specialist - Machine Learning

Key Skills

Software

Mercor

Labelbox

CVAT

Top Subject Matter

Electrical and electronics

Big data analysis

Machine learning.

Top Data Types

Text

Document

Video

Image

Top Task Types

Entity Ner Classification

Text Generation

Prompt Response Writing SFT

Fine Tuning

Audio Recording

Transcription

Action Recognition

Mapping

Data Collection

Evaluation Rating

Freelancer Overview

I am an experienced AI data specialist with over six years supporting machine learning and AI systems through high-quality data labeling, annotation, and workflow validation. My background spans large-scale projects in natural language processing, content evaluation, and automation logic, where I have consistently delivered accurate, reliable datasets for foundation model training and evaluation. I excel at designing and executing structured annotation workflows, performing error analysis, and maintaining detailed technical documentation. I am highly adaptable to new tools and thrive in remote, asynchronous environments, always ensuring quality, consistency, and timely delivery across diverse AI data initiatives.

IntermediateFrenchGermanEnglish

Labeling Experience

AI Data Contributor

CVATImageFine TuningEvaluation Rating

I participated in early-stage AI data programs focused on content evaluation, data validation, and quality assurance to support the development of reliable machine learning datasets. My role involved executing rule-based decision workflows that required precise judgment, attention to detail, and strict adherence to consistency standards. I systematically logged errors, anomalies, and improvement suggestions to help refine datasets and optimize downstream model performance. Working in a fully remote environment, I developed strong self-management skills, maintaining productivity, accuracy, and discipline while handling complex technical tasks independently.

2024 - 2025

AI Training & Evaluation Analyst

CVATVideoEntity Ner ClassificationAction Recognition

During my work on multiple AI training projects, I supported data annotation, linguistic evaluation, and workflow testing to ensure the creation of high-quality datasets for machine learning models. I carefully followed detailed task instructions and style guides to deliver consistent, accurate outputs while balancing high-volume workloads under tight deadlines. I proactively identified logical errors and edge cases within task pipelines, escalating issues with clear technical explanations to facilitate resolution and improve overall workflow reliability. Additionally, I quickly adapted to new tools, interfaces, and evolving project requirements, maintaining productivity and accuracy across diverse AI training initiatives.

2024 - 2025

Senior AI Data Specialist

LabelboxDocumentFine TuningPrompt Response Writing SFT

As a Senior AI Data Specialist at Scale AI, I contributed to the development of high-quality supervised fine-tuning and evaluation datasets across a range of text-based AI tasks. My responsibilities centered on producing reliable ground-truth data to support model training, validation, and performance benchmarking for advanced language systems. I designed and implemented structured labeling workflows that ensured clarity, consistency, and scalability across annotation cycles. A core component of my role involved performing cross-validation and secondary reviews to verify dataset integrity and reduce variance in labeling outputs. I analyzed edge cases, resolved inconsistencies, and applied systematic quality-control measures to strengthen overall dataset robustness. Additionally, I reviewed automation-style task flows and logical sequences to ensure procedural correctness, coherent reasoning structures, and compliance with project specifications.

2023 - 2024

AI Language & Data Trainer

MercorTextEntity Ner ClassificationText Generation

From May 2024 to March 2025, I worked as an AI Language & Data Trainer with Mercor, contributing to the development and evaluation of large-scale English language datasets used in training foundation models. My role focused on annotating, validating, and refining high-volume textual data to improve model accuracy, contextual understanding, and response reliability. I applied strict quality-control frameworks to ensure consistency, linguistic precision, and adherence to continuously evolving annotation guidelines. This required careful interpretation of task instructions, calibration against benchmark examples, and proactive identification of ambiguities within dataset specifications.

2023 - 2024

Education

N

N/A

Bachelor of Engineering, Electrical and Electronics Engineering

Bachelor of Engineering

2012 - 2016

Work History

A

Appen / Figure Eight

AI Training & Evaluation Analyst

New South Wales

2024 - 2025

M

Mercor (AI Training Platform)

AI Language & Data Trainer

California

2024 - 2025