For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Y

Yossef Nagy

Data Automation Engineer (AI-powered data crawling and enrichment)

EGYPT flag
Cairo, Egypt
$40.00/hrExpert

Key Skills

Software

No software listed

Top Subject Matter

AI-powered Parsing and Data Enrichment
Recruiter Data Collection and Enrichment
Lead Generation and CRM Data Structuring

Top Data Types

TextText
DocumentDocument

Top Task Types

Data Collection

Freelancer Overview

Data Automation Engineer (AI-powered data crawling and enrichment). Brings 4+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Science, Higher Institute for Technology & Science (Arish) (2023). AI-training focus includes data types such as Text and labeling workflows including Data Collection.

ExpertEnglish

Labeling Experience

Data Automation Engineer (AI-powered data crawling and enrichment)

TextData Collection
Engineered and executed data crawling pipelines that utilized AI-powered parsing and enrichment to deliver structured datasets. Integrated OpenAI-driven enrichment and schema standardization to significantly reduce manual data cleanup. Outputs were delivered in standardized JSON/CSV formats to support analytics and lead-generation pipelines. • Designed crawlers for non-API sources, utilizing Playwright and Selenium. • Integrated proxy rotation and advanced session management for reliability. • Automated parsing and schema validation for structured, CRM-ready outputs. • Leveraged OpenAI and internal tools for enhanced data enrichment.

Engineered and executed data crawling pipelines that utilized AI-powered parsing and enrichment to deliver structured datasets. Integrated OpenAI-driven enrichment and schema standardization to significantly reduce manual data cleanup. Outputs were delivered in standardized JSON/CSV formats to support analytics and lead-generation pipelines. • Designed crawlers for non-API sources, utilizing Playwright and Selenium. • Integrated proxy rotation and advanced session management for reliability. • Automated parsing and schema validation for structured, CRM-ready outputs. • Leveraged OpenAI and internal tools for enhanced data enrichment.

2024 - Present

Data Crawler & Automation Developer (Recruiting/Candidate Sourcing Pipelines)

TextData Collection
Automated candidate data pipelines using Playwright, optimizing batch retries and ensuring high completeness. Applied schema validation and field normalization to recruiter datasets, significantly reducing preparation time. Delivered enriched candidate data in structured formats for sourcing and recruitment campaigns. • Ensured 99% data completeness and consistency in recruiter pipelines. • Provided standardized recruiter datasets in JSON/CSV formats. • Automated field normalization and schema validation for efficiency. • Enabled recruiters to identify and qualify candidates more quickly.

Automated candidate data pipelines using Playwright, optimizing batch retries and ensuring high completeness. Applied schema validation and field normalization to recruiter datasets, significantly reducing preparation time. Delivered enriched candidate data in structured formats for sourcing and recruitment campaigns. • Ensured 99% data completeness and consistency in recruiter pipelines. • Provided standardized recruiter datasets in JSON/CSV formats. • Automated field normalization and schema validation for efficiency. • Enabled recruiters to identify and qualify candidates more quickly.

2023 - 2025

Data Crawler & Automation Developer (LinkedIn Lead Generation)

TextData Collection
Developed LinkedIn lead crawlers employing LLM-assisted query generation to collect large volumes of lead data. Delivered fully structured datasets in CSV/JSON for direct CRM integration, eliminating manual formatting. Applied deduplication and rate-limit safeguards to maximize dataset quality and protect accounts. • Utilized SearXNG and LLMs for scalable lead profile discovery. • Automated dataset structuring to support outbound campaign workflows. • Reduced data duplication and manual lead cleaning by 99%. • Ensured compliance in crawling strategies to prevent account bans.

Developed LinkedIn lead crawlers employing LLM-assisted query generation to collect large volumes of lead data. Delivered fully structured datasets in CSV/JSON for direct CRM integration, eliminating manual formatting. Applied deduplication and rate-limit safeguards to maximize dataset quality and protect accounts. • Utilized SearXNG and LLMs for scalable lead profile discovery. • Automated dataset structuring to support outbound campaign workflows. • Reduced data duplication and manual lead cleaning by 99%. • Ensured compliance in crawling strategies to prevent account bans.

2023 - 2023

Education

H

Higher Institute for Technology & Science (Arish)

Bachelor of Science, Communication Engineering

Bachelor of Science
2023

Work History

N

Neurascale

Data Automation Engineer

Cairo
2024 - Present
T

Talent Sourcing Automations

Data Crawler & Automation Developer

Cairo
2023 - 2025