Claims Document Annotation & Data Extraction Project Lead
Led a data annotation project to extract structured data from unstructured medical reports, police accident reports, and claims forms for an insurtech startup. Developed detailed guidelines for annotating and classifying critical fields, ensuring consistency and quality in the labeled data. Achieved extremely high accuracy, enabling the client to build a robust NLP model for automated claims processing. • Extracted data points like dates of service, ICD-10 codes, procedure codes, and claimant statements • Managed annotation of over 5,000 documents • Produced training data with 99.5% accuracy for NLP model • Oversaw annotation guideline design and workflow improvements