For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
H
Hillary Gathumbi

Hillary Gathumbi

Data Validation Specialist (Data Annotation + Extraction)

Kenya flagNairobi, Kenya
$10.00/hrExpertAppenLabelboxLabelimg

Key Skills

Software

AppenAppen
LabelboxLabelbox
LabelImgLabelImg
Scale AIScale AI
Label StudioLabel Studio
ProdigyProdigy
Snorkel AISnorkel AI
CVATCVAT
RoboflowRoboflow
Other

Top Subject Matter

Insurance/Financial Data Extraction
Telematics/IoT & Industrial Visual Data
Industrial Visual Inspection

Top Data Types

ImageImage
TextText
DocumentDocument
VideoVideo

Top Task Types

Object DetectionObject Detection
ClassificationClassification
Entity (NER) ClassificationEntity (NER) Classification
Text GenerationText Generation
TranscriptionTranscription
Data CollectionData Collection
Evaluation/RatingEvaluation/Rating
Prompt + Response Writing (SFT)Prompt + Response Writing (SFT)
Fine-tuningFine-tuning
Question AnsweringQuestion Answering
Text SummarizationText Summarization
Bounding BoxBounding Box
Action RecognitionAction Recognition

Freelancer Overview

Data Validation Specialist (Data Annotation + Extraction). Brings 12+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal, Proprietary Tooling, and CVAT. Education includes Bachelor of Science, Mount Kenya University (2019) and Kenya Certificate of Secondary Education, St Paul’s Kevote Embu High School (2010). AI-training focus includes data types such as Document, Video, and Image and labeling workflows including Entity (NER) Classification, Action Recognition, and Bounding Box.

ExpertEnglishSwahili

Labeling Experience

IT Assistant (Data & Systems Lead)

ImageClassification
Managed and validated structured manufacturing data within ERP systems, emphasizing high-precision labeling for inventory and production images. Verified AI-generated outputs as a human-in-the-loop to maintain compliance and accuracy in image-based visual inspection tasks. Used Python and SAP/NEXX systems for large-scale image and structured data labeling. • Quality assurance for industrial images • High-precision manual review of auto-labeled image outputs • Data validation and error correction in manufacturing datasets • Workflow optimization with human-in-the-loop verification

Managed and validated structured manufacturing data within ERP systems, emphasizing high-precision labeling for inventory and production images. Verified AI-generated outputs as a human-in-the-loop to maintain compliance and accuracy in image-based visual inspection tasks. Used Python and SAP/NEXX systems for large-scale image and structured data labeling. • Quality assurance for industrial images • High-precision manual review of auto-labeled image outputs • Data validation and error correction in manufacturing datasets • Workflow optimization with human-in-the-loop verification

2023 - Present

Consumer Trend and Inventory Pattern Labeling for Luxury Retail

TextData Collection
Led the analysis and tagging of sales and inventory data within a Cloud ERP system (NEXX). I categorized diverse datasets ranging from customer purchasing trends to inventory movement cycles. This "hand-labeling" of complex business data allowed for the creation of accurate management dashboards and informed predictive sales models. My work ensures that the data fed into business intelligence tools is accurately classified, providing a clean foundation for trend analysis and inventory optimization.

Led the analysis and tagging of sales and inventory data within a Cloud ERP system (NEXX). I categorized diverse datasets ranging from customer purchasing trends to inventory movement cycles. This "hand-labeling" of complex business data allowed for the creation of accurate management dashboards and informed predictive sales models. My work ensures that the data fed into business intelligence tools is accurately classified, providing a clean foundation for trend analysis and inventory optimization.

2023 - Present
CVAT

Technical Support Lead (IoT & Visual Data)

CVATVideoClassificationAction Recognition
Supervised review and classification of real-time video telematics and IoT sensor data for fleet management purposes. Identified and tagged safety-critical events in video feeds to provide ground-truth data. Supported development of automated monitoring models through accurate annotation of incidents. • Validated and classified industrial fleet video and IoT data. • Ensured high quality and compliance with annotation standards. • Supported AI model training by curating labeled event data. • Coordinated annotation tasks within technical support teams.

Supervised review and classification of real-time video telematics and IoT sensor data for fleet management purposes. Identified and tagged safety-critical events in video feeds to provide ground-truth data. Supported development of automated monitoring models through accurate annotation of incidents. • Validated and classified industrial fleet video and IoT data. • Ensured high quality and compliance with annotation standards. • Supported AI model training by curating labeled event data. • Coordinated annotation tasks within technical support teams.

2022 - 2022

Multi-Departmental Insurance Claims Data Validation & Categorization

DocumentEntity Ner ClassificationClassification
Performed high-precision classification and verification of claims documentation and policy records. The project involved extracting key entities (names, policy numbers, claim types) and validating them against structured ERP data. By implementing a rigorous multi-step audit process, I successfully reduced data input errors by 30%. This project highlights my ability to handle sensitive financial documents and provide high-quality ground truth data for natural language processing and document AI model

Performed high-precision classification and verification of claims documentation and policy records. The project involved extracting key entities (names, policy numbers, claim types) and validating them against structured ERP data. By implementing a rigorous multi-step audit process, I successfully reduced data input errors by 30%. This project highlights my ability to handle sensitive financial documents and provide high-quality ground truth data for natural language processing and document AI model

2020 - 2021
Roboflow

Industrial Visual Systems Specialist (Freelance Consultant)

RoboflowImageBounding Box
Installed and configured industrial visual inspection systems using CCTV and motion-tracking software. Manually defined detection zones and bounding box parameters for object detection tasks. Reduced false positives in industrial environments through precise annotation protocols. • Set up image-based motion detection and object recognition standards. • Labeled and refined detection parameters for industrial safety. • Implemented customized bounding box annotation for client requirements. • Supported continuous improvement of anomaly detection using annotated images.

Installed and configured industrial visual inspection systems using CCTV and motion-tracking software. Manually defined detection zones and bounding box parameters for object detection tasks. Reduced false positives in industrial environments through precise annotation protocols. • Set up image-based motion detection and object recognition standards. • Labeled and refined detection parameters for industrial safety. • Implemented customized bounding box annotation for client requirements. • Supported continuous improvement of anomaly detection using annotated images.

2015 - 2019

Education

M

MICROSOFT

AI & DATA SCIENCE SPECIALIZATION, Microsoft AI-900 (Azure AI Fundamentals)

AI & DATA SCIENCE SPECIALIZATION
2025 - 2026
M

Mount Kenya University

Bachelor of Science, Business Information Technology

Bachelor of Science
2015 - 2019

Work History

M

Maven Global Limited

IT Assistant (Data & Systems Lead)

Nairobi
2023 - Present
M

Maven Global Limited

IT Assistant

Nairobi
2023 - Present