For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Joseph Mukubu kapoya

Joseph Mukubu kapoya

AI Data Labeler | Text, Code & General Tasks

TUNISIA flag
Sousse, Tunisia
$25.00/hrIntermediateLabel StudioAws SagemakerAnno Mage

Key Skills

Software

Label StudioLabel Studio
AWS SageMakerAWS SageMaker
Anno-MageAnno-Mage
AppenAppen
ArgillaArgilla
Axiom AI
ClickworkerClickworker

Top Subject Matter

Text Annotation
Code Review
LLM Tasks

Top Data Types

DocumentDocument
TextText
ImageImage

Top Task Types

Segmentation
Classification
Prompt Response Writing SFT
Point Key Point
Polyline
Text Summarization
Transcription
Evaluation Rating
Computer Programming Coding
Function Calling
Data Collection
Question Answering
Object Detection
Cuboid
Entity Ner Classification
Bounding Box
Polygon
Text Generation
RLHF
Fine Tuning
Red Teaming

Freelancer Overview

Data Annotator – Cancer Analysis Data Preparation. Brings 1+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Label Studio, Internal, and Proprietary Tooling. Education includes Bachelor of Engineering, École Polytechnique de Sousse (2026). AI-training focus includes data types such as Medical, DICOM, and Document and labeling workflows including Segmentation, Classification, and Prompt + Response Writing (SFT).

IntermediateFrenchEnglishSpanish

Labeling Experience

Label Studio

Data Annotator – Cancer Analysis Data Preparation

Label StudioSegmentation
I participated in the cleaning and harmonization of multi-source datasets for cancer analysis projects. My work focused on meticulous labeling and verification of medical data to ensure high-quality predictive modeling. I assisted in the classification and annotation of diagnostic imaging using specialized annotation tools. • Labeled and harmonized data from clinical, survey, and medical imaging sources. • Ensured consistency in TNM labeling and cancer staging for accuracy in machine learning models. • Contributed to the annotation and classification of Wisconsin diagnostic images. • Utilized Label Studio, LabelImg, and CVAT for segmentation and quality assurance.

I participated in the cleaning and harmonization of multi-source datasets for cancer analysis projects. My work focused on meticulous labeling and verification of medical data to ensure high-quality predictive modeling. I assisted in the classification and annotation of diagnostic imaging using specialized annotation tools. • Labeled and harmonized data from clinical, survey, and medical imaging sources. • Ensured consistency in TNM labeling and cancer staging for accuracy in machine learning models. • Contributed to the annotation and classification of Wisconsin diagnostic images. • Utilized Label Studio, LabelImg, and CVAT for segmentation and quality assurance.

2025 - 2026
Label Studio

NLP Data Annotator – Generative AI & Chatbot Projects

Label StudioTextPrompt Response Writing SFT
I prepared conversational data and instruction-response pairs for language model fine-tuning projects. My work included annotating conversation logs and structuring high-quality datasets for training AI chatbots. Attention to contextual accuracy and relevance was prioritized in each annotation task. • Generated and labeled prompt-response pairs for generative AI chatbot training. • Conducted annotation of conversation data for NLP projects. • Ensured instructions and labels met fine-tuning standards for LLM performance. • Used Label Studio and Excel for data organization and annotation tracking.

I prepared conversational data and instruction-response pairs for language model fine-tuning projects. My work included annotating conversation logs and structuring high-quality datasets for training AI chatbots. Attention to contextual accuracy and relevance was prioritized in each annotation task. • Generated and labeled prompt-response pairs for generative AI chatbot training. • Conducted annotation of conversation data for NLP projects. • Ensured instructions and labels met fine-tuning standards for LLM performance. • Used Label Studio and Excel for data organization and annotation tracking.

2025 - 2025

Data Labeler – Churn Prediction Machine Learning Project

DocumentClassification
I labeled, cleaned, and engineered features from customer activity logs for churn prediction systems. My responsibilities included creating accurate target variables and ensuring data sets were well-prepared for model training. Manual review and correction of noisy data were also performed to optimize AI precision. • Created and labeled churn variables from raw customer activity logs. • Performed extensive manual data cleaning of tabular datasets. • Classified and annotated tabular data for machine learning input. • Used internal data pipeline tools and Excel for annotation and validation.

I labeled, cleaned, and engineered features from customer activity logs for churn prediction systems. My responsibilities included creating accurate target variables and ensuring data sets were well-prepared for model training. Manual review and correction of noisy data were also performed to optimize AI precision. • Created and labeled churn variables from raw customer activity logs. • Performed extensive manual data cleaning of tabular datasets. • Classified and annotated tabular data for machine learning input. • Used internal data pipeline tools and Excel for annotation and validation.

2025 - 2025

Education

É

École Polytechnique de Sousse

Bachelor of Engineering, Data Science and Artificial Intelligence

Bachelor of Engineering
2022 - 2026

Work History

K

Ksarsoft

Web Development Intern

Sousse
2025 - 2025
T

T.U.S

Artificial Intelligence Intern

Sousse
2025 - 2025