For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Francis Kamau

Francis Kamau

AI Data Training & Annotation Specialist | Text, Image & Audio

Kenya flagNairobi, Kenya
$15.00/hrExpertAws SagemakerAppenClickworker

Key Skills

Software

AWS SageMakerAWS SageMaker
AppenAppen
ClickworkerClickworker
CloudFactoryCloudFactory
Deep SystemsDeep Systems
MercorMercor
MindriftMindrift
OneFormaOneForma
RemotasksRemotasks
Scale AIScale AI
Surge AISurge AI
TelusTelus
LabelboxLabelbox
Other

Top Subject Matter

No subject matter listed

Top Data Types

AudioAudio
ImageImage
VideoVideo

Top Task Types

Computer Programming Coding
Data Collection
Entity Ner Classification
Evaluation Rating
Translation Localization

Freelancer Overview

I am an experienced data analyst and AI practitioner with over 7 years of experience in data analytics, machine learning, and AI model development. My background includes creating and deploying credit scoring and cash flow prediction models using Python, TensorFlow, and AWS, as well as managing international data teams. I bring advanced technical skills in Python, R, SQL, Tableau, and data visualization, with deep experience in data cleaning, labeling, standardization, and model evaluation. Beyond analytics, I have led Natural Language Processing (NLP) projects involving sentiment analysis, language detection, and named entity recognition using SpaCy and Hugging Face. I have also built datasets for translation and sentiment modeling, ensuring high-quality, ethically sound AI training data. My combination of technical expertise, leadership experience, and hands-on project delivery positions me to contribute effectively in building reliable and impactful AI systems.

ExpertSwahiliEnglishSpanish

Labeling Experience

Telus

Quality Rating and Evaluation for AI Models

TelusVideoEmotion RecognitionEvaluation Rating
Performed quality rating and evaluation tasks for AI and machine learning models. Responsibilities included assessing outputs from text, audio, and image models for accuracy, relevance, and adherence to guidelines. Provided detailed feedback to improve model performance and reduce biases. Maintained consistency and high quality through cross-checking with team members and following strict evaluation standards.

Performed quality rating and evaluation tasks for AI and machine learning models. Responsibilities included assessing outputs from text, audio, and image models for accuracy, relevance, and adherence to guidelines. Provided detailed feedback to improve model performance and reduce biases. Maintained consistency and high quality through cross-checking with team members and following strict evaluation standards.

2025
Scale AI

Data Annotation and Labeling for AI Training

Scale AIImageBounding BoxEntity Ner Classification
Performed data annotation and labeling across multiple modalities to support machine learning model training. Tasks included annotating text for named entities and sentiment, labeling images with bounding boxes and segmentation masks, and transcribing audio with timestamps. Ensured high-quality data by following detailed labeling guidelines, performing inter-annotator agreement checks, and conducting quality audits.

Performed data annotation and labeling across multiple modalities to support machine learning model training. Tasks included annotating text for named entities and sentiment, labeling images with bounding boxes and segmentation masks, and transcribing audio with timestamps. Ensured high-quality data by following detailed labeling guidelines, performing inter-annotator agreement checks, and conducting quality audits.

2024 - 2025

Data Annotation and Coding for AI Pipelines

OtherComputer Code ProgrammingComputer Programming CodingPrompt Response Writing SFT
In this project I performed coding and data programming tasks to support AI model training and data annotation pipelines. Developed scripts to automate data preprocessing, labeling, and validation workflows for text, image, and audio datasets. Implemented function calls, prompt-response generation, and quality checks to ensure accurate and high-quality labeled data

In this project I performed coding and data programming tasks to support AI model training and data annotation pipelines. Developed scripts to automate data preprocessing, labeling, and validation workflows for text, image, and audio datasets. Implemented function calls, prompt-response generation, and quality checks to ensure accurate and high-quality labeled data

2023 - 2025
Labelbox

Natural Language Data Labeling and Evaluation for NLP Model Training

LabelboxTextEntity Ner ClassificationSegmentation
Led a Natural Language Processing (NLP) data labeling initiative to build high-quality multilingual datasets for sentiment analysis, named entity recognition (NER), and translation model development. Managed and annotated text datasets used in training and evaluation of AI models, ensuring data quality, consistency, and balanced representation. Collaborated with a distributed labeling team using Appen and Labelbox, implementing Python scripts for automated validation and error detection. The labeled datasets were later used to fine-tune and evaluate AI language models for translation, sentiment detection, and chatbot response optimization.

Led a Natural Language Processing (NLP) data labeling initiative to build high-quality multilingual datasets for sentiment analysis, named entity recognition (NER), and translation model development. Managed and annotated text datasets used in training and evaluation of AI models, ensuring data quality, consistency, and balanced representation. Collaborated with a distributed labeling team using Appen and Labelbox, implementing Python scripts for automated validation and error detection. The labeled datasets were later used to fine-tune and evaluate AI language models for translation, sentiment detection, and chatbot response optimization.

2021 - 2024

Education

K

KCA University

Master of Science in Data Analytics, Data Science

Master of Science in Data Analytics
2023 - 2025
M

Maastricht Economic and Social Research Institute on Innovation and Technology UNU-MERIT

PHD Programe in Innovation, Data Science

PHD Programe in Innovation
2019 - 2025

Work History

L

L-iFT

Data Lead Analyst

Amsterdam
2019 - Present
A

Aesops

Lead Analyst: Natural Language Modeling Intiatives (Consultant)

Amsterdam
2022 - 2025