For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
K

Kennedy Muriuki

AI Research Associate, Scale AI

Kenya flagRemote, Kenya
$25.00/hrExpertOtherMercorRemotasks

Key Skills

Software

Other
MercorMercor
RemotasksRemotasks
TolokaToloka
AWS SageMakerAWS SageMaker
AppenAppen

Top Subject Matter

Large Language Models
AI model evaluation
Text mining

Top Data Types

TextText
DocumentDocument
ImageImage

Top Task Types

Entity (NER) ClassificationEntity (NER) Classification
RLHFRLHF
Fine-tuningFine-tuning
Computer Programming/CodingComputer Programming/Coding
Data CollectionData Collection
Prompt + Response Writing (SFT)Prompt + Response Writing (SFT)
Question AnsweringQuestion Answering
SegmentationSegmentation
ClassificationClassification
Text SummarizationText Summarization
Text GenerationText Generation

Freelancer Overview

AI Research Associate, Scale AI. Brings 3+ years of professional experience across legal operations, contract review, compliance, and structured analysis. Core strengths include Scale AI and Other. Education includes Bachelor of Science, Maseno University (2022). AI-training focus includes data types such as Text and labeling workflows including Evaluation, Rating, and Entity (NER) Classification.

ExpertEnglish

Labeling Experience

Scale AI

AI Research Associate, Scale AI

Scale AIText
Evaluated AI model performance and conducted structured error analysis on outputs from various LLMs. Generated actionable feedback to improve prompt design and influence model tuning and output consistency. Used Python pipelines to automate model evaluation, reducing manual validation and standardizing output comparisons across GPT-4, Claude, and Mistral. • Evaluated outputs using RLHF and custom metrics • Documented errors to improve prompt engineering • Automated model evaluation pipelines for reliability • Provided analysis for model tuning and output consistency

Evaluated AI model performance and conducted structured error analysis on outputs from various LLMs. Generated actionable feedback to improve prompt design and influence model tuning and output consistency. Used Python pipelines to automate model evaluation, reducing manual validation and standardizing output comparisons across GPT-4, Claude, and Mistral. • Evaluated outputs using RLHF and custom metrics • Documented errors to improve prompt engineering • Automated model evaluation pipelines for reliability • Provided analysis for model tuning and output consistency

2023 - 2025

Undergraduate Researcher – Machine Learning, Maseno University

OtherTextEntity Ner Classification
Labeled and annotated datasets for sentiment and entity extraction experiments using Python pipelines. Managed data scraping, preprocessing, and labeling processes to achieve high-accuracy AI model training datasets. Automated data collection to significantly reduce time required for dataset preparation. • Conducted sentiment and entity recognition labeling • Developed pipelines for efficient labeling and preprocessing • Ensured quality and consistency in AI training data • Achieved 95% accuracy in sentiment/entity extraction tasks

Labeled and annotated datasets for sentiment and entity extraction experiments using Python pipelines. Managed data scraping, preprocessing, and labeling processes to achieve high-accuracy AI model training datasets. Automated data collection to significantly reduce time required for dataset preparation. • Conducted sentiment and entity recognition labeling • Developed pipelines for efficient labeling and preprocessing • Ensured quality and consistency in AI training data • Achieved 95% accuracy in sentiment/entity extraction tasks

2021 - 2022

Education

M

Maseno University

Bachelor of Science, Information Technology

Bachelor of Science
2017 - 2022

Work History

B

Benchly

Software Engineer Intern

Remote
2023 - 2023
M

Maseno University

Undergraduate Researcher – Machine Learning

Maseno
2021 - 2022