For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
S
Samar Mestiri

Samar Mestiri

AI Training/Data Collection Apprentice

Tunisia flagTunis, Tunisia
$10.00/hrIntermediateOther

Key Skills

Software

Other

Top Subject Matter

Text-to-Speech and Computer Vision for Agriculture Application
Assistive AI for Dyslexic Learners (TTS, ASR, OCR)
NLP Social Behavior Classification

Top Data Types

AudioAudio
ImageImage
TextText

Top Task Types

Data CollectionData Collection
ClassificationClassification
Text GenerationText Generation
Text SummarizationText Summarization
Fine-tuningFine-tuning
Question AnsweringQuestion Answering
TranscriptionTranscription

Freelancer Overview

AI Training/Data Collection Apprentice. Brings 2+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal, Proprietary Tooling, and Other. Education includes Bachelor of Science, ISIMM (2021). AI-training focus includes data types such as Audio, Image, and Text and labeling workflows including Data Collection and Classification.

IntermediateEnglishArabicItalianFrench

Labeling Experience

Software Engineer Intern (AI Data Preparation)

ImageData Collection
Collected and generated speech and image datasets to support AI features in a digital learning application for dyslexic users. Integrated TTS, STT, OCR, and ASR models, requiring the curation of training and validation data. Designed an interactive pronunciation game which involved creation and annotation of relevant datasets. • Gathered and labeled audio and image data for TTS and OCR modules. • Developed data pipelines for model integration in Flask API. • Focused on annotation tasks to enhance pronunciation and reading tools. • Created datasets targeting learning accessibility.

Collected and generated speech and image datasets to support AI features in a digital learning application for dyslexic users. Integrated TTS, STT, OCR, and ASR models, requiring the curation of training and validation data. Designed an interactive pronunciation game which involved creation and annotation of relevant datasets. • Gathered and labeled audio and image data for TTS and OCR modules. • Developed data pipelines for model integration in Flask API. • Focused on annotation tasks to enhance pronunciation and reading tools. • Created datasets targeting learning accessibility.

2025 - 2025

AI Training/Data Collection Apprentice

AudioData Collection
Participated in data collection, cleaning, and creation of datasets for training AI models including Text-to-Speech (TTS) and Computer Vision. Contributed to the optimization, training, and testing of these models as part of an agriculture-focused mobile application project. Employed statistical analysis and visualization techniques to generate insights from labeled data. • Labeled audio and image data for AI training purposes. • Focused on improving TTS and vision model accuracy with curated datasets. • Used FastAPI and Unreal Engine in the project pipeline. • Conducted comprehensive data cleaning and preparation for model ingestion.

Participated in data collection, cleaning, and creation of datasets for training AI models including Text-to-Speech (TTS) and Computer Vision. Contributed to the optimization, training, and testing of these models as part of an agriculture-focused mobile application project. Employed statistical analysis and visualization techniques to generate insights from labeled data. • Labeled audio and image data for AI training purposes. • Focused on improving TTS and vision model accuracy with curated datasets. • Used FastAPI and Unreal Engine in the project pipeline. • Conducted comprehensive data cleaning and preparation for model ingestion.

2025 - 2025

Personal Data Annotation Project (Good Manners Citations)

OtherTextClassification
Developed an NLP pipeline to analyze and classify over 5,000 text citations regarding good manners. Executed annotation tasks to distinguish between positive and negative social behaviors in extracted web content. Processed and validated classification labels for final data reports. • Created structured training data for social behavior classification. • Annotated citations as examples of good or bad manners. • Utilized Python-based pipelines for label validation. • Focused on data integrity and quality checks.

Developed an NLP pipeline to analyze and classify over 5,000 text citations regarding good manners. Executed annotation tasks to distinguish between positive and negative social behaviors in extracted web content. Processed and validated classification labels for final data reports. • Created structured training data for social behavior classification. • Annotated citations as examples of good or bad manners. • Utilized Python-based pipelines for label validation. • Focused on data integrity and quality checks.

Not specified

Education

I

ISIMM

Bachelor and Master of Computer Science, Software Engineering

Bachelor and Master of Computer Science
2021 - 2026

Work History

A

Africxrjob-Netinfo

Software Engineer Apprentice

N/A
2025 - 2025
R

Readdly

Software Engineer Intern

N/A
2025 - 2025