OCR Transcription & Document Annotation for NLP Systems
Transcribed text from scanned documents and annotated key entities (names, dates, organizations). Worked on data cleaning, text correction, and classification for training an OCR-based NLP system used for automated document processing.