Data Scientist — Health & Biomedical Analytics
Developed and processed large-scale genomic and clinical datasets to facilitate disease prediction. Utilized Python, R, and SQL to prepare and organize text-based and structured health data for machine learning model development. Applied NLP techniques to extract and annotate key clinical entities within unstructured text records for downstream analyses. • Collaborated with WHO and CDC teams on data pipeline projects. • Focused on disease and antimicrobial resistance trends in public health datasets. • Produced annotated data for use in epidemiological ML models. • Presented data findings to non-technical public health stakeholders.