Scientific Literature & Chemical Data Annotation for AI/ML Applications
Leveraged deep domain expertise in organic and medicinal chemistry to annotate and label scientific text datasets intended for training chemistry-focused AI/ML models. Tasks included named entity recognition (NER) of chemical compounds, reaction types, functional groups, and pharmacological targets within peer-reviewed literature and patent documents. Applied subject matter expertise to classify structure-activity relationships (SAR), label reaction mechanisms, and evaluate the accuracy of model-generated chemical summaries and synthesis routes. Ensured high annotation quality through thorough review of chemical nomenclature, IUPAC naming conventions, and pharmaceutical terminology aligned with drug discovery workflows.