For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
D

Daniel Kadosh

Computer Science Evaluation Task Development - Data Labeler

USA flagDallas, Usa
$70.00/hrIntermediate

Key Skills

Software

No software listed

Top Subject Matter

Computer Science
AI Reasoning
Artificial Intelligence

Top Data Types

ImageImage
TextText

Top Task Types

RLHF
Fine Tuning
Computer Programming Coding

Freelancer Overview

Computer Science Evaluation Task Development - Data Labeler. Brings 1+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Doctor of Philosophy, University of Texas - Dallas (2025) and Master of Science, University of Texas - Dallas (2025). AI-training focus includes data types such as Image and Text and labeling workflows including Evaluation and Rating.

IntermediateEnglishHebrew

Labeling Experience

AI Benchmark Design and Model Evaluation - Data Labeler

Text
I contributed to the creation of multimodal and text-only AI benchmarks for model evaluation at Handshake AI. My responsibilities included designing, labeling, and analyzing difficult reasoning tasks and reviewing AI outputs for accuracy. I documented failure cases to support robust benchmarking and continuous improvement. • Designed and labeled multimodal and text-based reasoning tasks. • Evaluated model responses for completeness and validity. • Documented step-by-step solutions and outcome justifications. • Identified gaps in model understanding and reported issues.

I contributed to the creation of multimodal and text-only AI benchmarks for model evaluation at Handshake AI. My responsibilities included designing, labeling, and analyzing difficult reasoning tasks and reviewing AI outputs for accuracy. I documented failure cases to support robust benchmarking and continuous improvement. • Designed and labeled multimodal and text-based reasoning tasks. • Evaluated model responses for completeness and validity. • Documented step-by-step solutions and outcome justifications. • Identified gaps in model understanding and reported issues.

2025 - Present

Computer Science Evaluation Task Development - Data Labeler

Image
As an independent researcher at Handshake AI, I designed and evaluated benchmark tasks that challenged AI models to reason about complex visual concepts. My work involved labeling and categorizing image-based tasks and reviewing AI-generated outputs for accuracy and integrity. I ensured that benchmarks included both objective prompts and solutions for rigorous visual interpretation testing. • Labeled and evaluated images for computer science benchmarks. • Wrote prompts, answers, and justifications to test visual reasoning. • Reviewed model outputs and corrected inaccuracies. • Assessed AI failures and documented solutions.

As an independent researcher at Handshake AI, I designed and evaluated benchmark tasks that challenged AI models to reason about complex visual concepts. My work involved labeling and categorizing image-based tasks and reviewing AI-generated outputs for accuracy and integrity. I ensured that benchmarks included both objective prompts and solutions for rigorous visual interpretation testing. • Labeled and evaluated images for computer science benchmarks. • Wrote prompts, answers, and justifications to test visual reasoning. • Reviewed model outputs and corrected inaccuracies. • Assessed AI failures and documented solutions.

2025 - Present

Education

U

University of Texas - Dallas

Master of Science, Computer Science

Master of Science
2025 - 2025
U

University of Texas - Dallas

Bachelor of Science, Computer Science

Bachelor of Science
2022 - 2024

Work History

H

Handshake AI

Independent Researcher

Dallas
2025 - Present