For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
E

Esther Owubokiri

AI Trainer - Model Optimization & RLHF

Nigeria flagN/A, Nigeria
$10.00/hrExpertOtherAppenClickworker

Key Skills

Software

Other
AppenAppen
ClickworkerClickworker
TolokaToloka

Top Subject Matter

Large Language Models
Prompt Engineering
Rlhf Domain Expertise

Top Data Types

TextText
ImageImage
VideoVideo

Top Task Types

RLHFRLHF
Object DetectionObject Detection
Question AnsweringQuestion Answering
Text SummarizationText Summarization
Fine-tuningFine-tuning
TranscriptionTranscription
Evaluation/RatingEvaluation/Rating
Prompt + Response Writing (SFT)Prompt + Response Writing (SFT)

Freelancer Overview

AI Trainer - Model Optimization & RLHF. Brings 1+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Other. Education includes Professional Certificate, N/A (2026). AI-training focus includes data types such as Text and labeling workflows including RLHF.

ExpertEnglishFrench

Labeling Experience

AI Trainer - Model Optimization & RLHF

OtherTextRLHF
As an AI Trainer at Outlier AI, I conducted advanced RLHF tasks to refine the accuracy, reasoning, and safety of frontier LLMs. I developed challenging prompts across STEM, Humanities, and Logic domains to evaluate and improve model performance. My work involved rank-ordering LLM outputs using strict rubrics and composing detailed rationales for model decisions. • Executed high-fidelity evaluation of model outputs, emphasizing truthfulness and helpfulness. • Fact-checked model-generated claims, addressing hallucinations and inconsistencies. • Authored technical documentation supporting rationale-based learning. • Maintained consistent quality while adapting to evolving project guidelines.

As an AI Trainer at Outlier AI, I conducted advanced RLHF tasks to refine the accuracy, reasoning, and safety of frontier LLMs. I developed challenging prompts across STEM, Humanities, and Logic domains to evaluate and improve model performance. My work involved rank-ordering LLM outputs using strict rubrics and composing detailed rationales for model decisions. • Executed high-fidelity evaluation of model outputs, emphasizing truthfulness and helpfulness. • Fact-checked model-generated claims, addressing hallucinations and inconsistencies. • Authored technical documentation supporting rationale-based learning. • Maintained consistent quality while adapting to evolving project guidelines.

2026 - Present

Data Annotator / Moderator

OtherTextRLHF
At Crowdgen, I executed large-scale RLHF and data annotation for LLM safety and accuracy. My responsibilities included moderating content, applying complex policy guidelines, and collaborating on AI developer projects. I performed entity extraction and search relevance evaluation for AI applications. • Maintained a 98%+ accuracy rate in content moderation. • Supported search relevance and entity extraction initiatives for AI. • Applied rigorous standards for policy compliance and content safety. • Contributed to AI training datasets for global developers.

At Crowdgen, I executed large-scale RLHF and data annotation for LLM safety and accuracy. My responsibilities included moderating content, applying complex policy guidelines, and collaborating on AI developer projects. I performed entity extraction and search relevance evaluation for AI applications. • Maintained a 98%+ accuracy rate in content moderation. • Supported search relevance and entity extraction initiatives for AI. • Applied rigorous standards for policy compliance and content safety. • Contributed to AI training datasets for global developers.

2023 - 2025

Education

N

N/A

Professional Certificate, Cybersecurity

Professional Certificate
2026

Work History

L

Livingston Research

Academic Research Writer

N/A
2022 - 2022