For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Nahashon Mbugua

Nahashon Mbugua

LLM Fine-tuning & Multilingual Content Moderation

United Kingdom flaggermany, United Kingdom
$18.00/hrExpertCVATLabel StudioAppen

Key Skills

Software

CVATCVAT
Label StudioLabel Studio
AppenAppen
SuperAnnotateSuperAnnotate

Top Subject Matter

No subject matter listed

Top Data Types

Computer Code ProgrammingComputer Code Programming
TextText
VideoVideo

Top Task Types

Computer Programming Coding
Evaluation Rating
Fine Tuning
Prompt Response Writing SFT

Freelancer Overview

I’ve spent over four years working in data labeling and AI training data, helping to create the high-quality datasets that power machine learning models. My experience spans across computer vision, natural language processing (NLP), and audio processing, where I’ve worked on everything from image annotation to text classification and audio transcription. I’ve used a variety of tools, including Labelbox, Scale AI, CVAT, and have contributed to projects in industries like autonomous driving, e-commerce, and social media moderation, always focusing on accuracy and efficiency. I also have a strong background in LLM fine-tuning and multilingual content moderation, especially in English and German. I’ve worked on labeling datasets for tasks like sentiment analysis, text generation, and intent classification, making sure the data is clean and ready for training. My experience also includes evaluating AI models, where I provide feedback on their performance and help improve their accuracy. Overall, I bring a mix of technical skills and a keen eye for detail, making me a reliable contributor to any AI or data labeling project.

ExpertFrenchGermanEnglish

Labeling Experience

Appen

Audio Transcription & Sentiment Analysis for Customer Service

AppenAudioClassificationEmotion Recognition
Transcribed and labeled customer service call recordings for sentiment and emotion recognition. Analyzed interactions to detect service issues and improve speech-based AI support agents.

Transcribed and labeled customer service call recordings for sentiment and emotion recognition. Analyzed interactions to detect service issues and improve speech-based AI support agents.

2022 - 2024
SuperAnnotate

python Development

SuperannotateComputer Code ProgrammingComputer Programming Coding
I worked on a data labeling project where we used Python to help prepare and organize data for machine learning models. The main goal was to make sure the data was clean, well-labelled, and ready for training AI systems. What I Did: I wrote Python scripts to: Clean and format data before it was labelled. Automatically check for errors or missing labels. Help visualize data and annotations so we could spot mistakes more easily. I also helped upload and download large sets of data using scripts, which saved a lot of time for the team. Project Size: The dataset had over 1,000 files, including both images and text. I worked as part of a team, and we shared tools and gave each other feedback to make sure everything stayed on track. Some of the scripts I wrote were used daily by other annotators and engineers. Quality and Accuracy

I worked on a data labeling project where we used Python to help prepare and organize data for machine learning models. The main goal was to make sure the data was clean, well-labelled, and ready for training AI systems. What I Did: I wrote Python scripts to: Clean and format data before it was labelled. Automatically check for errors or missing labels. Help visualize data and annotations so we could spot mistakes more easily. I also helped upload and download large sets of data using scripts, which saved a lot of time for the team. Project Size: The dataset had over 1,000 files, including both images and text. I worked as part of a team, and we shared tools and gave each other feedback to make sure everything stayed on track. Some of the scripts I wrote were used daily by other annotators and engineers. Quality and Accuracy

2021 - 2024
SuperAnnotate

Vehicle Behavioural Video Annotation

SuperannotateVideoAction RecognitionTracking
Training autonomous vehicle perception systems Traffic behavior analysis and prediction

Training autonomous vehicle perception systems Traffic behavior analysis and prediction

2021 - 2024
Appen

LLM Fine-tuning & Multilingual Content Moderation

AppenTextClassificationEmotion Recognition
Worked on fine-tuning large language models (LLMs) by generating and evaluating prompt–response pairs in English and Spanish. Labeled harmful or biased content, evaluated AI responses for quality, and performed redteaming tasks to identify safety risks. Contributed to multilingual moderation datasets by classifying toxicity, hate speech, and inappropriate language across various cultural contexts.

Worked on fine-tuning large language models (LLMs) by generating and evaluating prompt–response pairs in English and Spanish. Labeled harmful or biased content, evaluated AI responses for quality, and performed redteaming tasks to identify safety risks. Contributed to multilingual moderation datasets by classifying toxicity, hate speech, and inappropriate language across various cultural contexts.

2020 - 2024

Education

U

University of Birmingham

Bachelor of Science (BSc) in Biomedical Sciences, Biomedical Science

Bachelor of Science (BSc) in Biomedical Sciences
2016 - 2019

Work History

No Work History added yet

Nahashon M. hasn’t added any Work History to their OpenTrain profile yet.