For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Mustafa Elaraby

Mustafa Elaraby

LLM Evaluation and Text Generation Specialist in English & Arabic & German

Egypt flagElminoufia, Egypt
$20.00/hrExpertAws SagemakerAppenArgilla

Key Skills

Software

AWS SageMakerAWS SageMaker
AppenAppen
ArgillaArgilla
Data Annotation TechData Annotation Tech
RemotasksRemotasks
Scale AIScale AI
TolokaToloka

Top Subject Matter

No subject matter listed

Top Data Types

Computer Code ProgrammingComputer Code Programming
ImageImage
TextText

Top Task Types

Classification
Computer Programming Coding
Evaluation Rating
RLHF
Text Generation

Freelancer Overview

As an experienced Embedded Systems Instructor, Software Developer, and AI Specialist, I have a strong background in C, C++, Python, and Microcontroller programming. I have trained students and professionals in Embedded C, AUTOSAR, and microprocessor interfacing (AVR, ARM). Additionally, I have hands-on experience in Large Language Model (LLM) evaluation, assessing AI-generated responses for instruction adherence, localization, truthfulness, verbosity, and clarity. My work in LLM evaluation ensures AI models produce high-quality, contextually accurate outputs. With expertise in CUDA programming, AI model optimization, and software development, I contribute to cutting-edge research and industry projects. Passionate about technological advancements, I continuously enhance my skills to stay at the forefront of Embedded Systems, AI, and LLM evaluation.

ExpertArabicGermanEnglish

Labeling Experience

Scale AI

Coding Expertise Data labeler

Scale AIComputer Code ProgrammingClassification
Project Scope & Data Labeling Tasks: The project focused on evaluating the accuracy, coherence, and adherence of AI-generated responses from a Large Language Model (LLM). My role involved assessing responses based on predefined guidelines, including instruction following, localization, truthfulness, verbosity, and clarity. Project Size: The project spanned thousands of AI-generated responses, requiring meticulous review and structured feedback. I collaborated with a team of evaluators to ensure consistency across diverse language tasks and domains. Quality Measures Adhered To: Instruction Following: Ensuring AI responses align with explicit and implicit user instructions. Localization: Verifying cultural and linguistic appropriateness. Truthfulness: Fact-checking and identifying hallucinations or misinformation. Verbosity & Clarity: Balancing response length with readability and informativeness.

Project Scope & Data Labeling Tasks: The project focused on evaluating the accuracy, coherence, and adherence of AI-generated responses from a Large Language Model (LLM). My role involved assessing responses based on predefined guidelines, including instruction following, localization, truthfulness, verbosity, and clarity. Project Size: The project spanned thousands of AI-generated responses, requiring meticulous review and structured feedback. I collaborated with a team of evaluators to ensure consistency across diverse language tasks and domains. Quality Measures Adhered To: Instruction Following: Ensuring AI responses align with explicit and implicit user instructions. Localization: Verifying cultural and linguistic appropriateness. Truthfulness: Fact-checking and identifying hallucinations or misinformation. Verbosity & Clarity: Balancing response length with readability and informativeness.

2022 - 2024

Education

Z

Zewail University of Sciences and Technology

Bachelor's, Aerospace Engineering

Bachelor's
2017 - 2022

Work History

D

DroneLeaf

AI Engineer

Cairo
2022 - Present