For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
D

David Wachira

LLM Evaluator / AI Trainer — Outlier AI (Remote)

KENYA flag
Nyeri, Kenya
$30.00/hrIntermediateOther

Key Skills

Software

Other

Top Subject Matter

Large Language Models
General Knowledge
Reasoning Domain Expertise

Top Data Types

TextText
DocumentDocument

Top Task Types

RLHF
Entity Ner Classification
Classification

Freelancer Overview

LLM Evaluator / AI Trainer — Outlier AI (Remote). Brings 4+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Other. Education includes Bachelor of Science, Moi University (2024). AI-training focus includes data types such as Text and labeling workflows including RLHF, Entity (NER) Classification, and Classification.

IntermediateEnglishSpanish

Labeling Experience

Data Annotator — Handshake (Remote)

OtherTextClassification
At Handshake, I delivered high-accuracy text annotations for classification and question answering workflows. I reviewed peer annotations to maintain dataset quality and contributed to resolving ambiguous labeling scenarios. My responsibilities included upholding standards and ensuring timely completion of annotation tasks. • Labeled text for classification and QA workflows. • Maintained high dataset quality through peer reviews. • Played a role in guideline clarification and ambiguity resolution. • Ensured reliable throughput with accuracy retention.

At Handshake, I delivered high-accuracy text annotations for classification and question answering workflows. I reviewed peer annotations to maintain dataset quality and contributed to resolving ambiguous labeling scenarios. My responsibilities included upholding standards and ensuring timely completion of annotation tasks. • Labeled text for classification and QA workflows. • Maintained high dataset quality through peer reviews. • Played a role in guideline clarification and ambiguity resolution. • Ensured reliable throughput with accuracy retention.

2023 - Present

AI Data Annotator — Uber AI (Remote)

OtherTextEntity Ner Classification
At Uber AI, I labeled text datasets for NLP tasks such as named entity recognition, sentiment, and intent classification. I also conducted quality assurance reviews to ensure labeling consistency and accuracy. I contributed tools and flagged edge cases to improve annotation guidelines and workflow efficiency. • Executed high-precision annotations for NLP tasks. • Performed QA reviews and ensured robust labeling standards. • Developed tools to enhance annotation workflows. • Collaborated to resolve ambiguous data points.

At Uber AI, I labeled text datasets for NLP tasks such as named entity recognition, sentiment, and intent classification. I also conducted quality assurance reviews to ensure labeling consistency and accuracy. I contributed tools and flagged edge cases to improve annotation guidelines and workflow efficiency. • Executed high-precision annotations for NLP tasks. • Performed QA reviews and ensured robust labeling standards. • Developed tools to enhance annotation workflows. • Collaborated to resolve ambiguous data points.

2023 - Present

LLM Evaluator / AI Trainer — Outlier AI (Remote)

OtherTextRLHF
As an LLM Evaluator and AI Trainer at Outlier AI, I evaluated and ranked large language model outputs using RLHF frameworks. My work focused on identifying hallucinations, reasoning errors, and instruction-following failures within text responses. I wrote structured rationales for evaluation decisions while ensuring consistent, high-quality scores across tasks. • Ranked and critiqued LLM responses for coherence and accuracy. • Assessed instruction-following, factual correctness, and safety. • Delivered structured rationales improving model outcomes. • Consistently met performance metrics for output quality.

As an LLM Evaluator and AI Trainer at Outlier AI, I evaluated and ranked large language model outputs using RLHF frameworks. My work focused on identifying hallucinations, reasoning errors, and instruction-following failures within text responses. I wrote structured rationales for evaluation decisions while ensuring consistent, high-quality scores across tasks. • Ranked and critiqued LLM responses for coherence and accuracy. • Assessed instruction-following, factual correctness, and safety. • Delivered structured rationales improving model outcomes. • Consistently met performance metrics for output quality.

2023 - Present

Education

M

Moi University

Bachelor of Science, Financial Economics

Bachelor of Science
2020 - 2024

Work History

H

Handshake (Remote)

Data Annotator

Location not specified
2023 - Present
U

Uber AI (Remote)

AI Data Annotator

Location not specified
2023 - Present