For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Omer Cuvan

Omer Cuvan

Senior Full-Stack Software Engineer - AI & Machine Learning

USA flag
Miami, Florida, Usa
$80.00/hrExpertScale AILabel StudioDoccano

Key Skills

Software

Scale AIScale AI
Label StudioLabel Studio
DoccanoDoccano

Top Subject Matter

No subject matter listed

Top Data Types

TextText

Top Label Types

Classification
Evaluation Rating
Prompt Response Writing SFT
Computer Programming Coding

Freelancer Overview

I have over 9 years of experience as a software engineer specializing in Python-based systems, AI/ML integrations, and the validation of algorithmic outputs. My work has focused on evaluating and improving AI-generated solutions by validating numerical results with tools like NumPy, SciPy, and Pandas, and providing structured technical feedback to refine model behavior. I have hands-on experience in prompt refinement, code review, and establishing scoring criteria for multi-step problem-solving, ensuring that AI outputs align with first-principles reasoning and accepted standards. My background includes building and validating AI-powered features for virtual assistants, developing backend services and APIs, and supporting benchmarking and evaluation workflows in both technical and collaborative remote environments. I am skilled at ensuring data quality, annotating and validating data flows, and documenting edge cases to support high-quality AI training data.

ExpertEnglish

Labeling Experience

Label Studio

Prompt + Response Authoring for SFT Data (Task & Scheduling)

Label StudioTextPrompt Response Writing SFT
Wrote high-quality prompts and gold-standard responses for multi-step task workflows (scheduling, prioritization, constraints, clarifications). Ensured responses followed first-principles reasoning, used explicit assumptions, and adhered to formatting/structure requirements. Reviewed and corrected model drafts to produce consistent SFT training examples.

Wrote high-quality prompts and gold-standard responses for multi-step task workflows (scheduling, prioritization, constraints, clarifications). Ensured responses followed first-principles reasoning, used explicit assumptions, and adhered to formatting/structure requirements. Reviewed and corrected model drafts to produce consistent SFT training examples.

2023 - 2025
Scale AI

LLM Output Evaluation & Algorithmic Validation for AI Assistant

Scale AITextClassificationEvaluation Rating
Evaluated AI-generated multi-step reasoning outputs for correctness, logical consistency, and adherence to first-principles engineering standards. Performed structured scoring of model responses, validated numerical and algorithmic outputs using Python (NumPy, SciPy, Pandas), and documented edge cases. Designed evaluation rubrics, conducted regression testing, and provided prompt refinement feedback to improve model reliability and reduce hallucinations. Reviewed thousands of model outputs across scheduling, task optimization, and structured problem-solving workflows. Ensured consistency between backend validation logic and model responses under real-world edge cases.

Evaluated AI-generated multi-step reasoning outputs for correctness, logical consistency, and adherence to first-principles engineering standards. Performed structured scoring of model responses, validated numerical and algorithmic outputs using Python (NumPy, SciPy, Pandas), and documented edge cases. Designed evaluation rubrics, conducted regression testing, and provided prompt refinement feedback to improve model reliability and reduce hallucinations. Reviewed thousands of model outputs across scheduling, task optimization, and structured problem-solving workflows. Ensured consistency between backend validation logic and model responses under real-world edge cases.

2023 - 2025
Doccano

Code & Algorithm Correctness Labeling (Backend + Logic QA)

DoccanoTextEvaluation RatingComputer Programming Coding
Evaluated code and algorithmic solutions for correctness, constraints, and edge-case handling. Labeled outputs as pass/fail with root-cause tags (incorrect assumptions, boundary errors, missing checks, performance issues). Verified results via unit tests and Python-based reproduction for deterministic validation.

Evaluated code and algorithmic solutions for correctness, constraints, and edge-case handling. Labeled outputs as pass/fail with root-cause tags (incorrect assumptions, boundary errors, missing checks, performance issues). Verified results via unit tests and Python-based reproduction for deterministic validation.

2018 - 2025
Label Studio

Support Automation QA Labeling

Label StudioTextClassificationEvaluation Rating
Labeled automation outcomes for customer support workflows (tagging, routing, triggers) and validated rule behavior against expected logic. Classified failures by category (wrong routing, missing conditions, incorrect state transitions) and documented reproducible cases for engineering/QA.

Labeled automation outcomes for customer support workflows (tagging, routing, triggers) and validated rule behavior against expected logic. Classified failures by category (wrong routing, missing conditions, incorrect state transitions) and documented reproducible cases for engineering/QA.

2020 - 2023

Education

A

Atlantis University

Bachelor of Science, Computer Science

Bachelor of Science
2011 - 2015

Work History

D

Devsinc

Senior Full-Stack Software Engineer

Miami
2023 - 2025
H

Help Scout

Full-Stack Software Developer

Miami
2020 - 2023