For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
S

Stephen

AI Data Specialist

KENYA flag
Kutus, Kenya
$20.00/hrExpertSamaCloudfactory

Key Skills

Software

SamaSama
CloudFactoryCloudFactory

Top Subject Matter

Computer Vision
Nlp Domain Expertise
LLM Chatbots

Top Data Types

ImageImage
TextText

Top Task Types

Segmentation
Prompt Response Writing SFT
Classification

Freelancer Overview

AI Data Specialist. Brings 2+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Sama, Internal, and Proprietary Tooling. Education includes Bachelor of Science, N/A (2021) and Bachelor of Arts, University of Nairobi (2020). AI-training focus includes data types such as Image, Computer Code, and Programming and labeling workflows including Segmentation, Prompt + Response Writing (SFT), and Classification.

ExpertEnglish

Labeling Experience

Freelance AI Data Strategist & Developer

Prompt Response Writing SFT
Developed custom pipelines for extracting code and generating instruction-response pairs for LLM fine-tuning. Engineered quality-filtering scripts and leveraged automation tools for high-volume data compilation. Ensured dataset readiness for specialized chatbot models. • Built data pipelines with OpenRouter for chatbot training data. • Extracted code from GitHub using automated scrapers. • Generated more than 5,000 high-quality instruction pairs. • Implemented deduplication and scoring scripts for QA.

Developed custom pipelines for extracting code and generating instruction-response pairs for LLM fine-tuning. Engineered quality-filtering scripts and leveraged automation tools for high-volume data compilation. Ensured dataset readiness for specialized chatbot models. • Built data pipelines with OpenRouter for chatbot training data. • Extracted code from GitHub using automated scrapers. • Generated more than 5,000 high-quality instruction pairs. • Implemented deduplication and scoring scripts for QA.

2025 - Present
Sama

AI Data Specialist

SamaImageSegmentation
Performed high-complexity data annotation tasks including semantic segmentation and technical labeling for computer vision and NLP models. Maintained accuracy ratings on enterprise-level datasets and collaborated on refining labeling guidelines. Ensured quality and consistency under tight project timelines. • Executed image and text annotation tasks for AI model training. • Specialized in technical domains within computer vision projects. • Achieved and maintained 98%+ accuracy on 'Gold Standard' benchmarks. • Worked with project managers to improve data labeling processes.

Performed high-complexity data annotation tasks including semantic segmentation and technical labeling for computer vision and NLP models. Maintained accuracy ratings on enterprise-level datasets and collaborated on refining labeling guidelines. Ensured quality and consistency under tight project timelines. • Executed image and text annotation tasks for AI model training. • Specialized in technical domains within computer vision projects. • Achieved and maintained 98%+ accuracy on 'Gold Standard' benchmarks. • Worked with project managers to improve data labeling processes.

2024 - Present
CloudFactory

AI Trainer

CloudfactoryTextClassification
Contributed to large-scale data labeling projects with a focus on quality and throughput. Employed established guidelines and reviewed peer annotations for accuracy. Supported integrity checks across high-volume text datasets. • Handled classification and labeling tasks for NLP datasets. • Participated in quality control and integrity processes. • Utilized CloudFactory labeling infrastructure and procedures. • Maintained data consistency and project timelines.

Contributed to large-scale data labeling projects with a focus on quality and throughput. Employed established guidelines and reviewed peer annotations for accuracy. Supported integrity checks across high-volume text datasets. • Handled classification and labeling tasks for NLP datasets. • Participated in quality control and integrity processes. • Utilized CloudFactory labeling infrastructure and procedures. • Maintained data consistency and project timelines.

2023 - 2023

Education

U

University of Nairobi

Bachelor of Arts, English and Linguistics

Bachelor of Arts
2016 - 2020
N

N/A

Bachelor of Science, Computer Science

Bachelor of Science
2021

Work History

F

Freelance

AI Data Strategist and Developer

Kutus
2025 - Present