For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
V
Valentine Mugambi

Valentine Mugambi

AI Engineer (Data Labeling Lead for Vision Datasets)

Kenya flagNairobi, Kenya
$10.00/hrExpertAppenAws SagemakerAxiom AI

Key Skills

Software

AppenAppen
AWS SageMakerAWS SageMaker
Axiom AI
ClickworkerClickworker
CloudFactoryCloudFactory
CrowdFlowerCrowdFlower
Data Annotation TechData Annotation Tech
DataloopDataloop
Deep SystemsDeep Systems
Figure EightFigure Eight
Google Cloud Vertex AIGoogle Cloud Vertex AI
HiveMindHiveMind
HumanaticHumanatic
iMeritiMerit
Img Lab
LabelboxLabelbox
LabelImgLabelImg
Label StudioLabel Studio
MercorMercor
Micro1
Mighty AIMighty AI
MindriftMindrift
OneFormaOneForma
RemotasksRemotasks
SamaSama
RoboflowRoboflow
Scale AIScale AI
Snorkel AISnorkel AI
SuperAnnotateSuperAnnotate
Surge AISurge AI
TolokaToloka
V7 LabsV7 Labs

Top Subject Matter

General AI & Multimodal Systems
Multi-domain (Customer Support, Legal, Medical)
Medical AI & Healthcare

Top Data Types

ImageImage
TextText
DocumentDocument

Top Task Types

SegmentationSegmentation
Fine-tuningFine-tuning
ClassificationClassification
Bounding BoxBounding Box
PolygonPolygon
Entity (NER) ClassificationEntity (NER) Classification
Point/Key PointPoint/Key Point
PolylinePolyline
CuboidCuboid
Object DetectionObject Detection
Text GenerationText Generation
Question AnsweringQuestion Answering
Text SummarizationText Summarization
RLHFRLHF
Red TeamingRed Teaming
TranscriptionTranscription
Evaluation/RatingEvaluation/Rating
Computer Programming/CodingComputer Programming/Coding
Data CollectionData Collection
Function CallingFunction Calling
Prompt + Response Writing (SFT)Prompt + Response Writing (SFT)

Freelancer Overview

AI Engineer (Data Labeling Lead for Vision Datasets). Brings 11+ years of professional experience across legal operations, contract review, compliance, and structured analysis. Core strengths include Internal and Proprietary Tooling. Education includes Doctor of Philosophy, University of Nairobi (2022) and Master of Science, University of Nairobi (2018). AI-training focus includes data types such as Image and Text and labeling workflows including Segmentation, Fine-tuning, and Classification.

ExpertEnglishSwahili

Labeling Experience

AI Engineer (Data Labeling Lead for Vision Datasets)

ImageSegmentation
Developed and implemented semi-supervised data labeling workflows for large-scale computer vision datasets exceeding 100,000 images. Used automated labeling pipelines followed by human review to maximize annotation quality and efficiency. Achieved a high annotation accuracy rate, while significantly reducing manual labeling costs for the company. • Led the design of labeling strategies combining automation and human-in-the-loop review. • Focused on segmentation and annotation of images for vision models. • Leveraged internal/proprietary tooling for streamlined workflow. • Collaborated with data scientists and engineers to optimize processes.

Developed and implemented semi-supervised data labeling workflows for large-scale computer vision datasets exceeding 100,000 images. Used automated labeling pipelines followed by human review to maximize annotation quality and efficiency. Achieved a high annotation accuracy rate, while significantly reducing manual labeling costs for the company. • Led the design of labeling strategies combining automation and human-in-the-loop review. • Focused on segmentation and annotation of images for vision models. • Leveraged internal/proprietary tooling for streamlined workflow. • Collaborated with data scientists and engineers to optimize processes.

2023 - Present

Machine Learning Specialist (Text Dataset Labeling/Tuning)

TextFine Tuning
Fine-tuned GPT-3.5-turbo models using labeled proprietary datasets including customer support logs, legal contracts, and medical Q&A pairs. Oversaw the curation, preparation, and validation of large-scale text datasets for supervised training tasks. Ensured high quality and diversity in text data to improve model robustness in real-world deployment. • Labeled and reviewed data for chatbot and document-oriented model fine-tuning. • Applied internal/proprietary tooling for large text dataset preparation. • Worked across different subject matter domains: customer service, legal, medical. • Estimated data volume in tens of thousands per dataset for fine-tuning.

Fine-tuned GPT-3.5-turbo models using labeled proprietary datasets including customer support logs, legal contracts, and medical Q&A pairs. Oversaw the curation, preparation, and validation of large-scale text datasets for supervised training tasks. Ensured high quality and diversity in text data to improve model robustness in real-world deployment. • Labeled and reviewed data for chatbot and document-oriented model fine-tuning. • Applied internal/proprietary tooling for large text dataset preparation. • Worked across different subject matter domains: customer service, legal, medical. • Estimated data volume in tens of thousands per dataset for fine-tuning.

2022 - 2023

Junior Software Engineer (Image Annotation Assistant)

ImageClassification
Assisted in labeling and augmenting image datasets for medical imaging research focusing on chest X-ray classification. Supported doctoral candidates with annotation tasks to improve data quality for deep learning experiments. Facilitated the expansion of the labeled dataset for enhanced model training and evaluation. • Used standard annotation protocols for medical imaging. • Tasks included identifying and classifying abnormal versus normal X-rays. • Augmented datasets with standard transforms to increase variation. • Work performed primarily with internal tools and guidance from supervisors.

Assisted in labeling and augmenting image datasets for medical imaging research focusing on chest X-ray classification. Supported doctoral candidates with annotation tasks to improve data quality for deep learning experiments. Facilitated the expansion of the labeled dataset for enhanced model training and evaluation. • Used standard annotation protocols for medical imaging. • Tasks included identifying and classifying abnormal versus normal X-rays. • Augmented datasets with standard transforms to increase variation. • Work performed primarily with internal tools and guidance from supervisors.

2016 - 2017

Education

U

University of Nairobi

Master of Science, Computer Engineering

Master of Science
2018 - 2018
K

Kenyatta University

Bachelor of Science, Computer Science

Bachelor of Science
2012 - 2012

Work History

D

DeepMind

AI Engineer

Nairobi
2023 - Present
O

OpenAI

Machine Learning Specialist

Nairobi
2022 - 2023