Data Labelling
Worked on AI data labelling projects at CloudFactory, focusing on preparing high-quality datasets for training machine learning and large language models. The project involved annotating and reviewing audio, text, and image data, with a strong emphasis on audio labelling for speech recognition, transcription, speaker identification, and audio classification. Responsibilities included applying clear labelling guidelines, training and supporting annotators, monitoring quality and consistency, and improving datasets based on error analysis. The work ensured accurate, reliable data and contributed to models that were ready for deployment and real-world use.