Key Skills

Software

Scale AI

Remotasks

Appen

Toloka

Top Subject Matter

AI/LLM evaluation

Nlp Domain Expertise

general knowledge

Top Data Types

Text

Top Task Types

No task types listed

Freelancer Overview

AI Output Evaluator. Brings 2+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Scale AI, Remotasks, and Appen. AI-training focus includes data types such as Text and labeling workflows including Evaluation and Rating.

Expert

Labeling Experience

AI Output Evaluator

Scale AIText

As an AI Output Evaluator, I assessed the accuracy, safety, and coherence of AI-generated responses across diverse text tasks. I consistently followed structured rubrics to evaluate outputs and provided written feedback to support model improvement. My work focused on rating reasoning, summarization, mathematics, and code-related responses using multiple platforms. • Used platform-provided guidelines to score outputs for correctness, factuality, and adherence to instructions. • Flagged hallucinated facts, logical inconsistencies, and unsafe content consistently. • Ranked model outputs by preference and delivered justifications grounded in evidence. • Maintained high inter-rater agreement and completed calibration for quality control.

2023 - Present

Education

No Education added yet

Daniel A. hasn’t added any Education History to their OpenTrain profile yet.

Work History

T

Testbirds

User Tester & Feedback Contributor

N/A

2022 - 2023