AI-powered Document Data Extraction (Traveloo Project)
Created AI-powered document processing pipelines focused on extracting structured information from unstructured flight-related PDF documents. Designed and engineered prompt templates to convert free-form booking confirmations into well-formatted JSON using LLMs. Leveraged OpenAI API and Anthropic Claude API for advanced information extraction and automated entity recognition. • Developed robust LLM prompts to ensure consistent and reliable extraction. • Validated and structured airline and flight booking data for downstream processing. • Evaluated and improved model outputs against ground truth data. • Integrated outputs into interactive visualization tools for user consumption.