Freelance AI Data Strategist & Developer
Developed custom pipelines for extracting code and generating instruction-response pairs for LLM fine-tuning. Engineered quality-filtering scripts and leveraged automation tools for high-volume data compilation. Ensured dataset readiness for specialized chatbot models. • Built data pipelines with OpenRouter for chatbot training data. • Extracted code from GitHub using automated scrapers. • Generated more than 5,000 high-quality instruction pairs. • Implemented deduplication and scoring scripts for QA.