For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Josh Rothberger

Josh Rothberger

Video Diarization Specialist

UNITED_KINGDOM flag
London, United Kingdom
$50.00/hrIntermediateMicro1MercorOther

Key Skills

Software

Micro1
MercorMercor
Other
Internal/Proprietary Tooling

Top Subject Matter

Speech and audio annotation
Phonetic and linguistic analysis
Audio analysis

Top Data Types

AudioAudio
VideoVideo

Top Task Types

Transcription
Segmentation
Classification

Freelancer Overview

Linguist and speech data specialist with over a year of hands-on AI training data experience across transcription, annotation, and quality assurance workflows. Currently working as a Video Diarization Specialist and Post-Completion Reviewer with Micro1, annotating multi-speaker audio with speaker identity, vocal characteristics, timestamps, and emotional register for multimodal AI training, as well as reviewing and quality-checking completed work across the project. Also work as a Linguistic Consultant with SIDE Global, producing IPA-level phonetic transcriptions, ASR correction, and dialect classification across a range of English accents and varieties. Bring a strong analytical foundation from over 4,000 hours of English language teaching, including systematic phonetic error analysis and IPA-based pronunciation work. Additional background in audio production using Ableton, Melodyne and Pro Tools, with practical experience in pitch analysis and sound quality assessment. Comfortable with high-volume, quality-controlled annotation work in distributed, async team environments.

IntermediateThaiGermanEnglishSpanishPortuguese

Labeling Experience

Video Diarization Specialist

Micro1Audio
As a Video Diarization Specialist at Micro1, I performed detailed analysis and annotation of audio to identify speakers and segment recordings. Tasks included aligning transcript timestamps with audio frames and labeling unique voice characteristics for each participant. The role required precision in creating structured timelines for multimodal AI training data.• Speaker diarization performed to separate and identify speakers throughout recordings.• Annotation of vocal identity, register, and delivery patterns for AI use.• Alignment of transcripts at fine temporal resolution for increased model accuracy.• Segmentation and labeling of both audio and visual cues for multimodal datasets.

As a Video Diarization Specialist at Micro1, I performed detailed analysis and annotation of audio to identify speakers and segment recordings. Tasks included aligning transcript timestamps with audio frames and labeling unique voice characteristics for each participant. The role required precision in creating structured timelines for multimodal AI training data.• Speaker diarization performed to separate and identify speakers throughout recordings.• Annotation of vocal identity, register, and delivery patterns for AI use.• Alignment of transcripts at fine temporal resolution for increased model accuracy.• Segmentation and labeling of both audio and visual cues for multimodal datasets.

2025 - Present

Linguistic Consultant – US English

OtherAudioTranscription
In my role as a Linguistic Consultant for SIDE Global, I classified accents and dialects, edited and corrected ASR outputs, and performed complex speech transcriptions. I applied linguistic annotation and technical frameworks for validating TTS alignment and prosodic features. Large volumes of annotated linguistic data were processed under rigorous quality control for NLP systems.• Accent and dialect classification using IPA and phonetic analysis.• ASR output editing, transcription, and document error patterns.• TTS alignment validation and speech synthesis quality assessment.• Quality control for large-scale annotated linguistic datasets.

In my role as a Linguistic Consultant for SIDE Global, I classified accents and dialects, edited and corrected ASR outputs, and performed complex speech transcriptions. I applied linguistic annotation and technical frameworks for validating TTS alignment and prosodic features. Large volumes of annotated linguistic data were processed under rigorous quality control for NLP systems.• Accent and dialect classification using IPA and phonetic analysis.• ASR output editing, transcription, and document error patterns.• TTS alignment validation and speech synthesis quality assessment.• Quality control for large-scale annotated linguistic datasets.

2024 - Present
Mercor

OCR Annotation Specialist

MercorVideoTranscription
As an OCR Annotation Specialist for Mercor, I annotated and transcribed text within video content using Optical Character Recognition for LLM training. I ensured that all visible textual elements in short-form video were precisely transcribed and timestamped. My work followed structured guidelines to ensure high-quality multimodal datasets.• Annotation and transcription of textual video elements for OCR tasks.• Accurate timestamping and high consistency in data labeling.• Structured evaluation to maintain data quality for AI training.• Contribution to creating standardized multimodal training data.

As an OCR Annotation Specialist for Mercor, I annotated and transcribed text within video content using Optical Character Recognition for LLM training. I ensured that all visible textual elements in short-form video were precisely transcribed and timestamped. My work followed structured guidelines to ensure high-quality multimodal datasets.• Annotation and transcription of textual video elements for OCR tasks.• Accurate timestamping and high consistency in data labeling.• Structured evaluation to maintain data quality for AI training.• Contribution to creating standardized multimodal training data.

2025 - 2025

Education

I

International House London

Certificate in Teaching English to Speakers of Other Languages, Teaching English to Speakers of Other Languages

Certificate in Teaching English to Speakers of Other Languages
2020 - 2020
R

Richmond-upon-Thames College

Advanced Level Certificate, Art History, Music Technology, Film Studies

Advanced Level Certificate
2008 - 2008

Work History

P

Preply

English Language Teacher (Freelance)

London
2020 - Present
C

Commercial Music Publishing

Audio Producer (Freelance)

London
2008 - 2020