Software Engineer Intern (AI Data Preparation)
Collected and generated speech and image datasets to support AI features in a digital learning application for dyslexic users. Integrated TTS, STT, OCR, and ASR models, requiring the curation of training and validation data. Designed an interactive pronunciation game which involved creation and annotation of relevant datasets. • Gathered and labeled audio and image data for TTS and OCR modules. • Developed data pipelines for model integration in Flask API. • Focused on annotation tasks to enhance pronunciation and reading tools. • Created datasets targeting learning accessibility.