Invoice Data Extraction and Classification Pipeline
Automated extraction and classification of invoice data using OCR and LLM pipelines. Labeled and categorized financial documents based on processed data to streamline accounting operations. Improved accuracy of document categorization by validating and refining AI pipeline outputs. • Used Mistral OCR and OpenAI GPT for extracting and classifying document data. • Reviewed and corrected document labels for accounting categories. • Provided feedback to improve model accuracy and efficiency. • Worked with financial and accounting subject matter for training set refinement.