Machine Learning data associate
Conducted annotation validation and automated testing, including unit tests and scripts to detect inconsistencies or invalid labels. Preprocessed and cleaned large-scale text, image, and multi-modal datasets using Python and SQL to create consistent, model ready inputs for AI training.