For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
C

Chudi Nwachukwu

AI Training / Annotation Specialist – RLHF Tasks (Scale Labs)

UNITED_KINGDOM flag
London, United Kingdom
$30.00/hrEntry LevelScale AI

Key Skills

Software

Scale AIScale AI

Top Subject Matter

Large Language Models/NLP
Legal Services & Contract Review
Regulatory Compliance & Risk Analysis

Top Data Types

TextText
DocumentDocument

Top Task Types

RLHF

Freelancer Overview

AI Training / Annotation Specialist – RLHF Tasks (Scale Labs). Brings 4+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Scale AI. Education includes Bachelor of Science. AI-training focus includes data types such as Text and labeling workflows including RLHF.

Entry LevelIgboEnglish

Labeling Experience

Scale AI

AI Training / Annotation Specialist – RLHF Tasks (Scale Labs)

Scale AITextRLHF
Led and contributed to Reinforcement Learning from Human Feedback (RLHF) tasks for training large language models. Developed and enforced detailed annotation guidelines and scoring rubrics to enhance evaluator consistency. Conducted hands-on annotation and evaluation including prompt evaluation, response ranking, safety review, and preference modeling. Improved training outcomes by tracking quality signals and refining guidelines. • Evaluated and ranked model outputs for prompt response quality. • Performed safety and policy-aligned judgement on ambiguous or edge-case outputs. • Created guidelines to minimize subjective annotation variance. • Analyzed error patterns and task rework to iteratively improve training results.

Led and contributed to Reinforcement Learning from Human Feedback (RLHF) tasks for training large language models. Developed and enforced detailed annotation guidelines and scoring rubrics to enhance evaluator consistency. Conducted hands-on annotation and evaluation including prompt evaluation, response ranking, safety review, and preference modeling. Improved training outcomes by tracking quality signals and refining guidelines. • Evaluated and ranked model outputs for prompt response quality. • Performed safety and policy-aligned judgement on ambiguous or edge-case outputs. • Created guidelines to minimize subjective annotation variance. • Analyzed error patterns and task rework to iteratively improve training results.

2025 - Present

Education

S

School not specified

Bachelor of Science, Business Management

Bachelor of Science
Not specified

Work History

S

Sinot Tech Solutions

Business Analyst

London
2025 - Present
G

GXO Logistics

Business Analyst / Warehouse Operative

London
2024 - 2025