For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Kutlwano Maboe

Kutlwano Maboe

Data Scientist - Data Capture and Management

SOUTH_AFRICA flag
Johannesburg, South Africa
$50.00/hrIntermediateLabelboxLabel StudioCVAT

Key Skills

Software

LabelboxLabelbox
Label StudioLabel Studio
CVATCVAT
Data Annotation TechData Annotation Tech
DataloopDataloop
Google Cloud Vertex AIGoogle Cloud Vertex AI

Top Subject Matter

General IT/Data Science

Top Data Types

TextText
Computer Code ProgrammingComputer Code Programming
ImageImage

Top Task Types

Data Collection
RLHF
Text Generation
Classification
Question Answering
Computer Programming Coding
Transcription
Text Summarization

Freelancer Overview

Data Scientist - Data Capture and Management. Professional background includes roles such as Data Scientist / Office Administrator. Core strengths include Microsoft Office. Education includes Grade 10 Certificate, N/A (2013). AI-training focus includes data types such as Text and labeling workflows including Data Collection.

IntermediateEnglish

Labeling Experience

Advanced Reasoning Data Annotation & LLM Fine-Tuning

TextRLHF
Project Overview: Developed a high-quality dataset designed to enhance the reasoning capabilities of Large Language Models (LLMs). The project focused on "Reasoning Trace" annotation, where raw problem-solving data was transformed into a structured metacognitive format. Key Contributions: * Strategic Annotation: Formatted complex datasets into specialized chat templates using <think> tags to teach models step-by-step logical deduction. * Data Synthesis: Prepared and labeled training pairs for fine-tuning models like Nemotron and TinyLlama using LoRA (Low-Rank Adaptation) techniques. * Quality Assurance: Ensured high-fidelity data labeling to optimize model performance for reasoning-heavy tasks, specifically focusing on mathematical and logical puzzles. * Technical Implementation: Managed the end-to-end pipeline from raw data ingestion to formatted training sets, ensuring the highest level of accuracy for model convergence.

Project Overview: Developed a high-quality dataset designed to enhance the reasoning capabilities of Large Language Models (LLMs). The project focused on "Reasoning Trace" annotation, where raw problem-solving data was transformed into a structured metacognitive format. Key Contributions: * Strategic Annotation: Formatted complex datasets into specialized chat templates using <think> tags to teach models step-by-step logical deduction. * Data Synthesis: Prepared and labeled training pairs for fine-tuning models like Nemotron and TinyLlama using LoRA (Low-Rank Adaptation) techniques. * Quality Assurance: Ensured high-fidelity data labeling to optimize model performance for reasoning-heavy tasks, specifically focusing on mathematical and logical puzzles. * Technical Implementation: Managed the end-to-end pipeline from raw data ingestion to formatted training sets, ensuring the highest level of accuracy for model convergence.

2026 - Present

Data Scientist - Data Capture and Management

TextData Collection
I worked as a Data Scientist focused on capturing, inputting, and managing data relevant to various projects. My work involved supporting operational reporting needs and ensuring the accuracy and integrity of datasets. I assisted with data collection and maintenance, helping uphold the quality of information for analysis and reporting purposes. • Accurately captured and input large volumes of data. • Performed routine data integrity checks and error correction. • Helped organize, validate, and maintain electronic and physical records. • Supported project-based data collection efforts for ongoing analysis.

I worked as a Data Scientist focused on capturing, inputting, and managing data relevant to various projects. My work involved supporting operational reporting needs and ensuring the accuracy and integrity of datasets. I assisted with data collection and maintenance, helping uphold the quality of information for analysis and reporting purposes. • Accurately captured and input large volumes of data. • Performed routine data integrity checks and error correction. • Helped organize, validate, and maintain electronic and physical records. • Supported project-based data collection efforts for ongoing analysis.

Present

Education

N

N/A

Grade 10 Certificate, General Education

Grade 10 Certificate
2013 - 2013

Work History

K

Kaggle platform Online

Data Scientist / Programmer

johannesburg
2023 - Present
N

N/A

Data Scientist / Office Administrator

Johannesburg
Not specified