For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Edwen Primrose

Edwen Primrose

LLM Evaluation and Audio Transcriber Specialist in English & Mandarin

USA flagMARYLAND, Usa
$20.00/hrExpertAppenClickworkerCrowdsource

Key Skills

Software

AppenAppen
ClickworkerClickworker
CrowdSourceCrowdSource
Data Annotation TechData Annotation Tech
LabelboxLabelbox
LabelImgLabelImg
Scale AIScale AI

Top Subject Matter

No subject matter listed

Top Data Types

AudioAudio
ImageImage
TextText

Top Task Types

Audio Recording
Computer Programming Coding
Prompt Response Writing SFT
RLHF
Text Summarization

Freelancer Overview

Edwen Primrose is an experienced AI Data Evaluator and Physics Specialist with over 10 years of research and analytical expertise. He has contributed to AI model training and data labeling projects focused on improving reasoning accuracy and conceptual consistency in technical subjects. With advanced proficiency in Python, R, and Stata, Edwen applies strong quantitative and critical thinking skills to evaluate AI-generated content across physics, econometrics, and behavioral modeling. His PhD-level background and keen attention to detail enable him to provide high-quality feedback that strengthens machine learning outputs, ensuring precision, depth, and reliability in artificial intelligence systems.

ExpertHindiEnglishSpanishChinese Mandarin

Labeling Experience

Labelbox

Advanced Video Data Annotation for Surgical AI and LLM Integration

LabelboxVideoEntity Ner ClassificationClassification
This project involved annotating a large-scale dataset of minimally invasive surgical videos to train and fine-tune a next-generation AI model. The core objective was twofold: first, to enable real-time surgical instrument and anatomy detection for computer-assisted intervention; and second, to create a structured dataset for a specialized Healthcare LLM to generate accurate surgical reports and post-operative summaries. Specific Data Labeling Tasks Performed: Instrument Detection & Tracking: Drew precise bounding boxes around surgical instruments (e.g., scalpels, forceps, clamps) in every frame, ensuring consistent tracking IDs to follow their movement and usage throughout procedures. Anatomy Segmentation: Used polylines to meticulously trace critical anatomical structures and tissue types, defining zones of operation and potential areas of risk. Action Recognition & Classification: Tagged video segments with specific surgical actions (e.g., "grasping," "cutting," "cauterizing," "

This project involved annotating a large-scale dataset of minimally invasive surgical videos to train and fine-tune a next-generation AI model. The core objective was twofold: first, to enable real-time surgical instrument and anatomy detection for computer-assisted intervention; and second, to create a structured dataset for a specialized Healthcare LLM to generate accurate surgical reports and post-operative summaries. Specific Data Labeling Tasks Performed: Instrument Detection & Tracking: Drew precise bounding boxes around surgical instruments (e.g., scalpels, forceps, clamps) in every frame, ensuring consistent tracking IDs to follow their movement and usage throughout procedures. Anatomy Segmentation: Used polylines to meticulously trace critical anatomical structures and tissue types, defining zones of operation and potential areas of risk. Action Recognition & Classification: Tagged video segments with specific surgical actions (e.g., "grasping," "cutting," "cauterizing," "

2024 - 2024
Scale AI

AI Text Evaluation and Data Labeling for Large Language Models

Scale AITextText GenerationPrompt Response Writing SFT
Contributed to large-scale AI training and evaluation projects focused on improving the accuracy and reasoning depth of language models. Responsibilities included labeling, classifying, and evaluating AI-generated responses across scientific, technical, and behavioral domains. Reviewed text outputs for factual correctness, coherence, and adherence to guidelines, providing structured feedback to refine machine learning models. Ensured data quality through multiple review cycles, maintaining over 98% accuracy in annotation performance metrics.

Contributed to large-scale AI training and evaluation projects focused on improving the accuracy and reasoning depth of language models. Responsibilities included labeling, classifying, and evaluating AI-generated responses across scientific, technical, and behavioral domains. Reviewed text outputs for factual correctness, coherence, and adherence to guidelines, providing structured feedback to refine machine learning models. Ensured data quality through multiple review cycles, maintaining over 98% accuracy in annotation performance metrics.

2024 - 2024

Education

U

University of Chicago

PhD, Physics

PhD
2010 - 2014
U

University of Michigan, Ann Arbor

Master of Physics, Physics

Master of Physics
2010 - 2010

Work History

H

Harvard University

Associate Professor of Physics

Cambridge
2016 - 2021
B

Brookings Institution

Physics Policy Research Fellow

Washington, D.C.
2014 - 2016