For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Mohamed Maged

Mohamed Maged

LLM Evaluation and Prompt Generation Specialist in English & Arabic

Egypt flagCairo, Egypt
$15.00/hrIntermediateLabelboxOneformaRemotasks

Key Skills

Software

LabelboxLabelbox
OneFormaOneForma
RemotasksRemotasks
Scale AIScale AI

Top Subject Matter

No subject matter listed

Top Data Types

AudioAudio
ImageImage
TextText

Top Task Types

Bounding Box
Classification
Evaluation Rating
Prompt Response Writing SFT
Red Teaming

Freelancer Overview

Experienced Data Annotator with 2+ years in AI training and data labeling. Skilled in NLP, text analysis, and dataset evaluation with a strong focus on accuracy and consistency. Proven ability to deliver high-quality training data to support AI and machine learning projects.

IntermediateArabicEnglish

Labeling Experience

Scale AI

Omani Project

Scale AITextEvaluation Rating
A project called Odyssey has more than one phase. In the first phase, we write the prompt based on specific criteria, such as whether the prompt should include someone working in the Omani ministry. In the second phase, these prompts move to evaluation. There are more than one type: there is a translation project, a copywriting project, and another for creative content.

A project called Odyssey has more than one phase. In the first phase, we write the prompt based on specific criteria, such as whether the prompt should include someone working in the Omani ministry. In the second phase, these prompts move to evaluation. There are more than one type: there is a translation project, a copywriting project, and another for creative content.

2025
Scale AI

Convo Mode Project

Scale AIAudioEvaluation Rating
In this project, we are testing two artificial intelligence models by using conversation mode and reviewing the audio for both models. For example, in the first model, we use Gemini and in the second model, chat-gpt, and we talk to them about a specific topic and we try as much as possible to make the experience similar between the two models, and we evaluate each model separately and in the end we do the general evaluation and comparison between the two models.

In this project, we are testing two artificial intelligence models by using conversation mode and reviewing the audio for both models. For example, in the first model, we use Gemini and in the second model, chat-gpt, and we talk to them about a specific topic and we try as much as possible to make the experience similar between the two models, and we evaluate each model separately and in the end we do the general evaluation and comparison between the two models.

2024 - 2025
Scale AI

Bulba Project

Scale AITextEvaluation Rating
A big project called Bulba had many projects within it, and the basis of this project was the development of Gemini (formerly bard) in terms of many things. For example, there was a project related to factuality, and its basis was based on verifying the accuracy of the information found in the two responses, and then we would prefer a response from them.

A big project called Bulba had many projects within it, and the basis of this project was the development of Gemini (formerly bard) in terms of many things. For example, there was a project related to factuality, and its basis was based on verifying the accuracy of the information found in the two responses, and then we would prefer a response from them.

2023 - 2025

Education

A

Ain Shams University

Bachelor of Law, Law

Bachelor of Law
2022

Work History

R

RAYA CX

Costumer Service

Cairo
2021 - 2022