For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
C

Carren Kirwa

Scientific Data Annotation Specialist — Handshake

United Kingdom flagCambridge, United Kingdom
ExpertCVATScale AIRemotasks

Key Skills

Software

CVATCVAT
Scale AIScale AI
RemotasksRemotasks
AppenAppen
LabelboxLabelbox

Top Subject Matter

STEM (math, physics, chemistry, statistics) and industrial manufacturing images
Stem Domain Expertise
data science

Top Data Types

ImageImage
TextText
AudioAudio
VideoVideo
DocumentDocument

Top Task Types

Bounding BoxBounding Box
Prompt + Response Writing (SFT)Prompt + Response Writing (SFT)
TranscriptionTranscription

Freelancer Overview

Scientific Data Annotation Specialist — Handshake. Brings 9+ years of professional experience across legal operations, contract review, compliance, and structured analysis. Core strengths include CVAT, Outlier, and Scale AI. Education includes Doctor of Philosophy, University of Lincoln (2021) and Master of Science, University of Sunderland (2020). AI-training focus includes data types such as Image, Text, and Audio and labeling workflows including Bounding Box, Evaluation, and Rating.

Expert

Labeling Experience

CVAT

Scientific Data Annotation Specialist — Handshake

CVATImageBounding Box
Reviewed and corrected auto-annotated frames from industrial and manufacturing image datasets using CVAT, focusing on process plant and refinery environments. Designed and validated STEM evaluation scenarios for mathematics, chemistry, thermodynamics, quantum mechanics, and statistics content. Authored and verified SFT prompt-and-response pairs in STEM and data science, and performed quality rating of GenAI outputs across multiple domains. • Applied bounding box annotation and classification labels using CVAT • Performed quality control and QA reviews of large-scale image datasets • Validated complex, multi-step mathematical and statistical reasoning in AI-generated solutions • Conducted red-team exercises for hallucination detection and error documentation

Reviewed and corrected auto-annotated frames from industrial and manufacturing image datasets using CVAT, focusing on process plant and refinery environments. Designed and validated STEM evaluation scenarios for mathematics, chemistry, thermodynamics, quantum mechanics, and statistics content. Authored and verified SFT prompt-and-response pairs in STEM and data science, and performed quality rating of GenAI outputs across multiple domains. • Applied bounding box annotation and classification labels using CVAT • Performed quality control and QA reviews of large-scale image datasets • Validated complex, multi-step mathematical and statistical reasoning in AI-generated solutions • Conducted red-team exercises for hallucination detection and error documentation

2026 - Present

STEM Domain Expert Evaluator — Outlier & TELUS Digital AI Community

Text
Assessed AI-generated text outputs for advanced STEM, data science, and ML tasks including equation solving, machine learning evaluation, and data science workflows. Performed side-by-side preference ranking and detailed rationale reporting for scientific accuracy and code correctness. Designed and validated computational statistics and programming problems, and authored multi-turn tutoring dialogues. • Evaluated GenAI outputs, including ranking and structured scoring • Validated solutions using Python-based workflows for reproducibility • Designed original analytical content and scenarios • Identified error patterns and provided structured feedback for model training

Assessed AI-generated text outputs for advanced STEM, data science, and ML tasks including equation solving, machine learning evaluation, and data science workflows. Performed side-by-side preference ranking and detailed rationale reporting for scientific accuracy and code correctness. Designed and validated computational statistics and programming problems, and authored multi-turn tutoring dialogues. • Evaluated GenAI outputs, including ranking and structured scoring • Validated solutions using Python-based workflows for reproducibility • Designed original analytical content and scenarios • Identified error patterns and provided structured feedback for model training

2023 - 2025
Scale AI

Annotation QA Lead & Guideline Author — Scale AI & Appen

Scale AIText
Led QA and guideline authorship for annotation projects across STEM, data science, ML, computer science, biology, and general domains. Designed gold-standard, honeypot, and calibration tasks while monitoring inter-annotator agreement and retraining needs. Executed QA sweeps of annotation files and collaborated with project leads to define rubric-based evaluation. • Wrote guidelines, appendices, and decision trees for annotation clarity • Designed calibration and gold-standard tasks for scientific content • Executed quality assurance sweeps using Python scripts • Reported throughput and quality metrics to platform leads

Led QA and guideline authorship for annotation projects across STEM, data science, ML, computer science, biology, and general domains. Designed gold-standard, honeypot, and calibration tasks while monitoring inter-annotator agreement and retraining needs. Executed QA sweeps of annotation files and collaborated with project leads to define rubric-based evaluation. • Wrote guidelines, appendices, and decision trees for annotation clarity • Designed calibration and gold-standard tasks for scientific content • Executed quality assurance sweeps using Python scripts • Reported throughput and quality metrics to platform leads

2022 - 2024
Appen

Speech & Text Data Annotator — Remote (Contract)

AppenAudioTranscription
Transcribed audio recordings, performed speaker diarization with speaker IDs, and annotated timestamps for non-speech events. Labeled text and chat logs for sentiment, toxicity, named entities, and relationships to train NER models. Reviewed ASR model outputs against reference transcripts to categorise error types for targeted model improvements. • Applied intent, topic, and toxicity labels to text and chat logs • Annotated named entities and conducted relationship labeling for NER • Conducted transcription and speaker diarization for audio data • Categorised and flagged ASR errors for QA and model improvement

Transcribed audio recordings, performed speaker diarization with speaker IDs, and annotated timestamps for non-speech events. Labeled text and chat logs for sentiment, toxicity, named entities, and relationships to train NER models. Reviewed ASR model outputs against reference transcripts to categorise error types for targeted model improvements. • Applied intent, topic, and toxicity labels to text and chat logs • Annotated named entities and conducted relationship labeling for NER • Conducted transcription and speaker diarization for audio data • Categorised and flagged ASR errors for QA and model improvement

2020 - 2023
Remotasks

AI Trainer & Data Labeler — Remotasks & DataAnnotation.tech

RemotasksTextPrompt Response Writing SFT
Authored SFT prompt-and-response pairs in Python, data science, statistical analysis, ML, SQL, math, and scientific reasoning domains. Performed image-based QA on scientific figures and label verification, and designed computational data science problems for validation. Conducted red-team and adversarial prompt tasks across STEM and data science scenarios. • Wrote multi-turn dialogue prompts for STEM tutoring contexts • Validated and flagged unanswerable or ambiguous examples • Ensured deterministic and reproducible problem solutions • Documented adversarial output reproduction for modeller review

Authored SFT prompt-and-response pairs in Python, data science, statistical analysis, ML, SQL, math, and scientific reasoning domains. Performed image-based QA on scientific figures and label verification, and designed computational data science problems for validation. Conducted red-team and adversarial prompt tasks across STEM and data science scenarios. • Wrote multi-turn dialogue prompts for STEM tutoring contexts • Validated and flagged unanswerable or ambiguous examples • Ensured deterministic and reproducible problem solutions • Documented adversarial output reproduction for modeller review

2021 - 2022

Education

U

University of Sunderland

Master of Science, Pharmaceutical and Chemical Sciences

Master of Science
2019 - 2020
U

University of Nairobi

Bachelor of Science, Mathematics, Physics and Chemistry

Bachelor of Science
2015 - 2018

Work History

A

Astrazeneca

Senior Computational Scientist

Cambridge
2023 - Present
J

Johnson Matthey

Research Scientist – Chemical Modelling

London
2021 - 2022