Advanced Web Scraping & Data Normalization
Designed and executed complex data extraction workflows targeting JavaScript-heavy, dynamic websites. Utilized Playwright to automate browser interactions for client-side rendering, managed pagination and rate limiting, and used Pandas for rigorous data cleaning. Integrated Apify Actors and LLMs to parse unstructured HTML into strict, normalized formats for clean dataset delivery.