LLM-Powered Data Cleaning & Annotation Specialist (Freelance)
Developed and ran a large language model (LLM) powered data cleaning pipeline to auto-normalize inconsistent fields in scraped text datasets. Performed automated normalization, classification, and anomaly flagging on structured and semi-structured textual data. Integrated LLM-based prompt engineering workflows to standardize outputs for data quality and reduced manual annotation workload by 70%. • Implemented OpenRouter API for AI-driven data cleaning • Automated labeling to classify and normalize raw text • Applied prompt engineering for text annotation routines • Ensured validation and accuracy in cleaned outputs