For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Sulaiman Alatwah

Sulaiman Alatwah

NLP & LLM Researcher - AI and Data Science

SAUDI_ARABIA flag
dammam, Saudi Arabia
$30.00/hrIntermediateOther

Key Skills

Software

Other

Top Subject Matter

No subject matter listed

Top Data Types

AudioAudio

Top Label Types

Fine Tuning
Data Collection

Freelancer Overview

I am a fresh graduate specializing in Artificial Intelligence, with hands-on experience in data annotation, dataset development, and preparing high-quality training data for machine learning models. During my internship as an NLP & LLM researcher, I contributed to the Dialect Regression Project by systematically collecting and annotating audio data from diverse Arabic dialects, ensuring fair representation across regions and genders. My work involved speech preprocessing, feature extraction, and data labeling to support dialect classification and speech recognition tasks. I have also built AI systems for tasks like phishing detection using multilingual text analysis and image-based logo detection, applying frameworks such as mBERT and Gemini Vision API. My strong attention to detail, technical skills in Python, and experience with both text and audio data make me well-equipped to support data labeling and AI training data initiatives.

IntermediateEnglishArabic

Labeling Experience

dialect recognition

OtherImageObject DetectionAction Recognition
š€š‘š‚š€šƒš„ enables city-level Arabic dialect modeling from real radio audio, moving beyond coarse country labels. šŸ“Œ š–š”ššš­ š°šž šššœš”š¢šžšÆšžš: - 3,790 radio clips (~31.6 hours) - 6,907 expert annotations - 58 cities across 19 countries - Labeled by 11 š­š«ššš¢š§šžš š§ššš­š¢šÆšž ššš§š§šØš­ššš­šØš«š¬ with multi-review quality control! We hope this resource will š¬šžš«šÆšž š­š”šž š€š«ššš›š¢šœ š¬š©šžšžšœš” & šš‹š šœšØš¦š¦š®š§š¢š­š², accelerate reproducible research, and support stronger dialect-aware technologies.

š€š‘š‚š€šƒš„ enables city-level Arabic dialect modeling from real radio audio, moving beyond coarse country labels. šŸ“Œ š–š”ššš­ š°šž šššœš”š¢šžšÆšžš: - 3,790 radio clips (~31.6 hours) - 6,907 expert annotations - 58 cities across 19 countries - Labeled by 11 š­š«ššš¢š§šžš š§ššš­š¢šÆšž ššš§š§šØš­ššš­šØš«š¬ with multi-review quality control! We hope this resource will š¬šžš«šÆšž š­š”šž š€š«ššš›š¢šœ š¬š©šžšžšœš” & šš‹š šœšØš¦š¦š®š§š¢š­š², accelerate reproducible research, and support stronger dialect-aware technologies.

2025 - 2025

dialect recognition

OtherAudioFine TuningData Collection
š€š‘š‚š€šƒš„ enables city-level Arabic dialect modeling from real radio audio, moving beyond coarse country labels. šŸ“Œ š–š”ššš­ š°šž šššœš”š¢šžšÆšžš: - 3,790 radio clips (~31.6 hours) - 6,907 expert annotations - 58 cities across 19 countries - Labeled by 11 š­š«ššš¢š§šžš š§ššš­š¢šÆšž ššš§š§šØš­ššš­šØš«š¬ with multi-review quality control! We hope this resource will š¬šžš«šÆšž š­š”šž š€š«ššš›š¢šœ š¬š©šžšžšœš” & šš‹š šœšØš¦š¦š®š§š¢š­š², accelerate reproducible research, and support stronger dialect-aware technologies.

š€š‘š‚š€šƒš„ enables city-level Arabic dialect modeling from real radio audio, moving beyond coarse country labels. šŸ“Œ š–š”ššš­ š°šž šššœš”š¢šžšÆšžš: - 3,790 radio clips (~31.6 hours) - 6,907 expert annotations - 58 cities across 19 countries - Labeled by 11 š­š«ššš¢š§šžš š§ššš­š¢šÆšž ššš§š§šØš­ššš­šØš«š¬ with multi-review quality control! We hope this resource will š¬šžš«šÆšž š­š”šž š€š«ššš›š¢šœ š¬š©šžšžšœš” & šš‹š šœšØš¦š¦š®š§š¢š­š², accelerate reproducible research, and support stronger dialect-aware technologies.

2025 - 2025

Education

T

Tuwaiq Academy

Diploma, AI and Data Science

Diploma
2024 - 2025
J

Jubail Industrial College

Bachelor of Science, Computer Science

Bachelor of Science
2019 - 2024

Work History

W

White Dome

Administrative Assistant

Dammam
2019 - Present
R

RIOTU Lab Prince Sultan University

NLP & LLM Researcher

Dammam
2025 - 2025