AI & Linguistic Data Engineer
I evaluated large-scale speech and text datasets for ASR, NLU, and TTS systems to ensure accuracy and linguistic performance. I performed detailed linguistic error and trend analysis across Marathi, Hindi, and English language datasets. I also executed SQL-driven audits to detect labeling errors and managed freelance annotator workflows. • Evaluated and rated ASR, NLU, and TTS data for language models. • Audited text and speech annotations for errors and inconsistencies. • Coordinated freelance annotators and improved validation workflows. • Analyzed multilingual data to improve model accuracy and safety.