For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
C

Chinemerem Kalu

AI Content Researcher/Generalist – RLHF (Reinforcement Learning from Human Feedback)

United Kingdom flagPlymouth, United Kingdom
$15.00/hrEntry LevelOther

Key Skills

Software

Other

Top Subject Matter

Large Language Models (LLMs) and AI Content Research
Legal Services & Contract Review
Regulatory Compliance & Risk Analysis

Top Data Types

TextText
DocumentDocument

Top Task Types

RLHF

Freelancer Overview

AI Content Researcher/Generalist – RLHF (Reinforcement Learning from Human Feedback). Brings 7+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include N and A. Education includes Bachelor of Science, N/A (2024). AI-training focus includes data types such as Text and labeling workflows including RLHF.

Entry LevelEnglish

Labeling Experience

AI Content Researcher/Generalist – RLHF (Reinforcement Learning from Human Feedback)

TextRLHF
Contributed to AI training projects by evaluating and ranking model responses using Reinforcement Learning from Human Feedback. Ensured model accuracy and safety by identifying logical fallacies and factual inaccuracies in textual outputs. Applied prompt engineering and analytical writing skills to maximize LLM performance and relevance. • Evaluated and rated large language model responses. • Identified hallucinations and inaccuracies in output text. • Applied strict guidelines and rubrics for feedback accuracy. • Utilized technical reporting and fact-checking expertise in RLHF tasks.

Contributed to AI training projects by evaluating and ranking model responses using Reinforcement Learning from Human Feedback. Ensured model accuracy and safety by identifying logical fallacies and factual inaccuracies in textual outputs. Applied prompt engineering and analytical writing skills to maximize LLM performance and relevance. • Evaluated and rated large language model responses. • Identified hallucinations and inaccuracies in output text. • Applied strict guidelines and rubrics for feedback accuracy. • Utilized technical reporting and fact-checking expertise in RLHF tasks.

Not specified

Education

U

university of plymouth

bsc, plymouth

bsc
2024 - 2024

Work History

A

Arlington Hotel and Suites

Supervisor & Accountant

Plymouth
2022 - 2024
N

NESREA

IT & Environmental Analyst

Abuja
2021 - 2022