R&D Intern — Athena Health Pvt. Ltd.
Developed an NLP-based entity extraction system to parse scanned medical documents and populate structured databases. Retrained models with POS tagging features to improve extraction accuracy, focusing on labeling entities in textual data. Contributed to the end-to-end ML pipeline, including data preprocessing and iterative model training for improved recognition of medical entities. • Labeled entities in medical documents for named entity recognition (NER) tasks. • Enhanced entity extraction by adding POS tagging features to annotated datasets. • Utilized Python and NLP libraries to assist in annotation and preprocessing tasks. • Improved accuracy of the entity extraction model through iterative labeling and retraining.