Video & Audio Annotation for Autonomous Driving & Speech Recognition Models
In this ongoing project, I am responsible for labeling and annotating audio data for a speech recognition model used in virtual assistants and voice-driven AI applications. The primary task involves transcribing and classifying spoken language into text to train natural language processing models. I work with large volumes of audio recordings, including conversations, commands, and varied accents to ensure comprehensive language model training. The project includes the following key tasks: Audio Transcription: Converting spoken words from audio files into accurate text, ensuring punctuation, context, and proper noun recognition. Speech Recognition Annotation: Tagging different types of speech such as commands, questions, and conversational speech to train the AI to better understand natural language. Emotion & Tone Detection: Annotating speech patterns to help the model recognize and understand emotions (e.g., happy, angry, neutral). Speaker Identification: Labeling different spea