Speaker Diarization
The project aims to transcribe audio files to support the development of a cutting-edge automatic speech recognition model. Participants are responsible for transcribing and performing speaker diarization on audio files up to five minutes long. This involves either improving existing pre-transcriptions or creating new transcriptions from scratch. Additionally, non-speech tags must be provided for sounds occurring simultaneously with speech, such as non-word pronunciations. A key aspect of the project is the precise timestamping of audio to mark continuous speech, defined as speech with pauses of less than 0.5 seconds. Participants must also track and identify speakers by adding timestamps at the beginning and end of each speaker change. To maintain project participation, a minimum commitment of 10 hours per week is required, with an accuracy rate of at least 90%. Quality assurance processes involve reviewing an average of 5% of the work to ensure adherence to these standards.