For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Abdulrahman Ibrahim

Abdulrahman Ibrahim

RLHF and AI Evaluation Annotator

NIGERIA flag
Abuja, Nigeria
$20.00/hrExpertScale AICVATLabelbox

Key Skills

Software

Scale AIScale AI
CVATCVAT
LabelboxLabelbox
Other
Data Annotation TechData Annotation Tech
Google Cloud Vertex AIGoogle Cloud Vertex AI

Top Subject Matter

Stem Domain Expertise
Software logic
Technical writing

Top Data Types

Computer Code ProgrammingComputer Code Programming
TextText
DocumentDocument
ImageImage

Top Task Types

Entity Ner Classification
RLHF
Classification
Question Answering
Computer Programming Coding
Text Generation
Bounding Box

Freelancer Overview

RLHF and AI Evaluation Annotator. Core strengths include uTest and Testlio. AI-training focus includes data types such as Text and labeling workflows including RLHF.

ExpertEnglish

Labeling Experience

CVAT

Image Annotation and Data Labeling Contributor

CVATTextClassificationRLHF
I conducted Reinforcement Learning from Human Feedback (RLHF) to improve language model performance, focusing on STEM and software logic subject matter. Duties involved evaluating responses from AI systems for mathematical and structural accuracy, ranking outputs, and flagging safety or logic errors. Step-by-step reasoning verification and prompt consistency audit were also key parts of my workflow. • Used uTest and Testlio as annotation and evaluation platforms. • Assessed logical flow and correctness for technical and academic prompts. • Specialized in STEM-focused prompt engineering and ranking. • Produced detailed documentation of errors and edge-case handling.

I conducted Reinforcement Learning from Human Feedback (RLHF) to improve language model performance, focusing on STEM and software logic subject matter. Duties involved evaluating responses from AI systems for mathematical and structural accuracy, ranking outputs, and flagging safety or logic errors. Step-by-step reasoning verification and prompt consistency audit were also key parts of my workflow. • Used uTest and Testlio as annotation and evaluation platforms. • Assessed logical flow and correctness for technical and academic prompts. • Specialized in STEM-focused prompt engineering and ranking. • Produced detailed documentation of errors and edge-case handling.

2022 - Present

Technical Subject Matter Expert Structural Engineering/software engineering

TextRLHF
Performed high-level data validation and RLHF for Large Language Models (LLMs) specifically focused on STEM and Engineering accuracy. My tasks involved auditing AI-generated responses for structural mechanics, verifying complex calculations against BS 5950 design codes, and ensuring logical 'Chain of Thought' reasoning in solving beam stress and footing design problems. I maintained a 98% accuracy rating across all technical validation tasks.

Performed high-level data validation and RLHF for Large Language Models (LLMs) specifically focused on STEM and Engineering accuracy. My tasks involved auditing AI-generated responses for structural mechanics, verifying complex calculations against BS 5950 design codes, and ensuring logical 'Chain of Thought' reasoning in solving beam stress and footing design problems. I maintained a 98% accuracy rating across all technical validation tasks.

2025 - 2025

Education

A

Alike Dangote University of Science and Technology

Bachelor of Engineering, Structural Engineering

Bachelor of Engineering
2022

Work History

S

Selar (Freelance/Entrepreneurship)

All rounder

Suleja
2024 - Present
N

N/A

Technical Content Developer

Suleja
2023 - Present