For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Omar Nagy

Omar Nagy

AI Annotator & Evaluator | Experienced in Text, Code & Reasoning Tasks

Egypt flagcairo, Egypt
$30.00/hrIntermediateAppenClickworkerLabelbox

Key Skills

Software

AppenAppen
ClickworkerClickworker
LabelboxLabelbox
OneFormaOneForma
RemotasksRemotasks
Scale AIScale AI
SuperAnnotateSuperAnnotate
TolokaToloka

Top Subject Matter

No subject matter listed

Top Data Types

Computer Code ProgrammingComputer Code Programming
ImageImage
TextText

Top Task Types

Computer Programming Coding
Function Calling
Red Teaming
RLHF
Text Generation

Freelancer Overview

For the past two years I’ve specialised in human-in-the-loop training of large-language models, supplying >6 000 high-quality feedback samples that power code assistants and chatbots. My core work spans RLHF — ranking and critiquing model-generated code, diagnosing compiler/runtime errors, and crafting gold-standard fixes — using pipelines on Scale AI, Toloka and Labelbox. These datasets feed reward models that raise functional accuracy, reduce bias, and harden models against unsafe outputs. In parallel, I lead multilingual prompt QA (English ↔ Arabic) and machine-translation post-editing. My safety focus includes extensive red-teaming and jailbreak testing—simulating adversarial prompts that expose bias or policy leaks before deployment.

IntermediateArabicEnglish

Labeling Experience

Labelbox

Function-Call Trace Annotation for Conversational Agents

LabelboxComputer Code ProgrammingEvaluation RatingFunction Calling
Labeled 500+ tool-use chains (arguments, returns, error states) that drive fine-tuning and regression tests for API-enabled chat agents.

Labeled 500+ tool-use chains (arguments, returns, error states) that drive fine-tuning and regression tests for API-enabled chat agents.

2025 - 2025
Appen

Multilingual MTPE & Safety QA

AppenTextText GenerationTranslation Localization
Post-edited 100 k+ words (EN↔AR) in MTPE workflows and ran content-safety sweeps, cutting critical error rate below 2 %

Post-edited 100 k+ words (EN↔AR) in MTPE workflows and ran content-safety sweeps, cutting critical error rate below 2 %

2025 - 2025
OneForma

Bilingual Prompt QC & MT Post-Editing

OneformaTextTranslation LocalizationEvaluation Rating
Audited Arabic/English prompts for tone and cultural fit; produced the style guide now used across OneForma’s ISAAC translation projects

Audited Arabic/English prompts for tone and cultural fit; produced the style guide now used across OneForma’s ISAAC translation projects

2024 - 2025
Scale AI

RLHF Code-Assistant Dataset

Scale AIComputer Code ProgrammingRLHFEvaluation Rating
Ranked and critiqued 900+ code generations, authored reference solutions and unit tests, and annotated JSON function-call schemas

Ranked and critiqued 900+ code generations, authored reference solutions and unit tests, and annotated JSON function-call schemas

2023 - 2025

Education

S

Suez Canal University

Bachelor's Degree, Computers And Informatics

Bachelor's Degree
2014 - 2018

Work History

N

NeuraScale

Principal AI Architect (Consultant)

Remote
2024 - 2025
A

Alpine Laser

Senior WordPress & Automation Consultant

Remote
2023 - 2024