Mustafa Elaraby - LLM Evaluation and Text Generation Specialist in English & Arabic & German

Key Skills

Software

AWS SageMaker

Appen

Argilla

Data Annotation Tech

Remotasks

Scale AI

Toloka

Top Subject Matter

No subject matter listed

Top Data Types

Computer Code Programming

Image

Text

Top Task Types

Classification

Computer Programming Coding

Evaluation Rating

RLHF

Text Generation

Freelancer Overview

As an experienced Embedded Systems Instructor, Software Developer, and AI Specialist, I have a strong background in C, C++, Python, and Microcontroller programming. I have trained students and professionals in Embedded C, AUTOSAR, and microprocessor interfacing (AVR, ARM). Additionally, I have hands-on experience in Large Language Model (LLM) evaluation, assessing AI-generated responses for instruction adherence, localization, truthfulness, verbosity, and clarity. My work in LLM evaluation ensures AI models produce high-quality, contextually accurate outputs. With expertise in CUDA programming, AI model optimization, and software development, I contribute to cutting-edge research and industry projects. Passionate about technological advancements, I continuously enhance my skills to stay at the forefront of Embedded Systems, AI, and LLM evaluation.

ExpertArabicGermanEnglish

Labeling Experience

Coding Expertise Data labeler

Scale AIComputer Code ProgrammingClassification

Project Scope & Data Labeling Tasks: The project focused on evaluating the accuracy, coherence, and adherence of AI-generated responses from a Large Language Model (LLM). My role involved assessing responses based on predefined guidelines, including instruction following, localization, truthfulness, verbosity, and clarity. Project Size: The project spanned thousands of AI-generated responses, requiring meticulous review and structured feedback. I collaborated with a team of evaluators to ensure consistency across diverse language tasks and domains. Quality Measures Adhered To: Instruction Following: Ensuring AI responses align with explicit and implicit user instructions. Localization: Verifying cultural and linguistic appropriateness. Truthfulness: Fact-checking and identifying hallucinations or misinformation. Verbosity & Clarity: Balancing response length with readability and informativeness.

2022 - 2024

Education

Z

Zewail University of Sciences and Technology

Bachelor's, Aerospace Engineering

Bachelor's

2017 - 2022

Work History

D

DroneLeaf

AI Engineer

Cairo

2022 - Present