For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
I

Ibrahim Awny

AI-Powered Dataset Preparation Tool Developer

Egypt flagCairo, Egypt
$30.00/hrIntermediateScale AI

Key Skills

Software

Scale AIScale AI

Top Subject Matter

Dataset preparation for AI/ML
AI Writing Assistant for text generation and summarization
Structured data extraction from code and web sources

Top Data Types

TextText
ImageImage

Top Task Types

Classification
Text Generation
Bounding Box
Segmentation
Object Detection
Evaluation Rating
Function Calling
Prompt Response Writing SFT
Entity Ner Classification

Freelancer Overview

AI-Powered Dataset Preparation Tool Developer. Brings 4+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Science, Ain Shams University (2025). AI-training focus includes data types such as Text, Computer Code, and Programming and labeling workflows including Classification and Text Generation.

IntermediateEnglishArabic

Labeling Experience

AI-Powered Dataset Preparation Tool Developer

TextClassification
Built an open-source interactive dataset preparation tool that combines rule-based analysis with an AI agent to detect and fix common dataset issues. The tool enables users to preview fixes and build reusable exportable pipelines for automating data cleaning tasks. Incorporated both traditional static validation and AI-driven recommendations to ensure high-quality datasets for machine learning models. • Automated the detection and correction of missing values, outliers, and formatting errors. • Enhanced reliability by validating AI agent suggestions against rule-based checks. • Supported user-driven review of all labeling and cleaning steps before export. • Streamlined the process of preparing datasets for downstream AI training workflows.

Built an open-source interactive dataset preparation tool that combines rule-based analysis with an AI agent to detect and fix common dataset issues. The tool enables users to preview fixes and build reusable exportable pipelines for automating data cleaning tasks. Incorporated both traditional static validation and AI-driven recommendations to ensure high-quality datasets for machine learning models. • Automated the detection and correction of missing values, outliers, and formatting errors. • Enhanced reliability by validating AI agent suggestions against rule-based checks. • Supported user-driven review of all labeling and cleaning steps before export. • Streamlined the process of preparing datasets for downstream AI training workflows.

2026 - 2026

Structured Data Extractor AI Agent Developer

Classification
Created an AI agent to extract structured data from unstructured sources, focusing on programming-oriented web pages and search results. Used dynamic schemas and validation to classify and annotate code snippets and related outputs. Ensured consistent and accurate labeling for use in downstream AI extraction and code understanding tasks. • Labeled web data and program outputs for schema-compliant extraction. • Employed Pydantic validation for format and consistency checking. • Designed annotation protocols for technical and code-based content. • Facilitated structured data training for AI code understanding systems.

Created an AI agent to extract structured data from unstructured sources, focusing on programming-oriented web pages and search results. Used dynamic schemas and validation to classify and annotate code snippets and related outputs. Ensured consistent and accurate labeling for use in downstream AI extraction and code understanding tasks. • Labeled web data and program outputs for schema-compliant extraction. • Employed Pydantic validation for format and consistency checking. • Designed annotation protocols for technical and code-based content. • Facilitated structured data training for AI code understanding systems.

2025 - 2025

NotBad: AI Writer’s Assistant Developer

TextText Generation
Integrated and fine-tuned AI agents for various text generation tasks such as summarization, translation, grammar checking, and next-word prediction. Built custom LSTM models requiring large-scale labeled data for supervised training. Developed user-friendly tools to streamline the labeling and model feedback process for improving LLM and sequence modeling output. • Curated and annotated text corpora for model training and evaluation. • Implemented iterative data labeling workflows to optimize performance. • Validated outputs through manual and automated feedback cycles. • Adhered to quality and consistency standards for LLM assistant outputs.

Integrated and fine-tuned AI agents for various text generation tasks such as summarization, translation, grammar checking, and next-word prediction. Built custom LSTM models requiring large-scale labeled data for supervised training. Developed user-friendly tools to streamline the labeling and model feedback process for improving LLM and sequence modeling output. • Curated and annotated text corpora for model training and evaluation. • Implemented iterative data labeling workflows to optimize performance. • Validated outputs through manual and automated feedback cycles. • Adhered to quality and consistency standards for LLM assistant outputs.

2024 - 2024

Education

A

Ain Shams University

Bachelor of Science, Computer Science

Bachelor of Science
2021 - 2025

Work History

K

Kayfa

Data Scientist

Cairo
2025 - Present
B

Banque Misr

Data Science Intern

Cairo
2024 - 2024