Speech Recognition & Speaker Diarization Datasets
Specialized in segmenting long audio recordings, marking speaker changes, and tagging non-speech events to prepare datasets for speech recognition and noise-robust modeling.
Hire this AI Trainer
Sign in or create an account to invite AI Trainers to your job.
No subject matter listed
A versatile and detail-oriented professional with over 3 years of experience in data annotation, transcription, translation, localization, and visual data preparation across diverse datasets including audio, video, image, and text. I possess strong skills in developing accurate training data for AI systems, conducting linguistic and visual evaluations, and ensuring quality through rigorous QA workflows. My experience extends to evaluating and QAing large language models (LLMs) for conversational and generative tasks, developing rubrics for accuracy, safety, and coherence assessment, and mentoring evaluators. I've contributed to advanced annotation of audio and image datasets, created onboarding materials, and led workshops for new annotators. My expertise includes object detection, semantic segmentation, and action recognition for image and video, as well as preparing datasets for speech recognition and speaker diarization. I am adept at collaborating with global teams, adapting to evolving AI pipelines, and maintaining high data standards for precision and scalability. Additionally, I bring a unique blend of creative photography with data annotation principles for AI image training.
Specialized in segmenting long audio recordings, marking speaker changes, and tagging non-speech events to prepare datasets for speech recognition and noise-robust modeling.
Engaged in detailed annotation tasks for audio and image datasets, focusing on accuracy and consistency. This included identifying specific elements within these datasets to support AI training initiatives.
Bachelors Degree in Theatre Arts, Creative/Theatre Arts
Photography & Visual Expert