ML Research Engineer - Dataset Curation and Fine-tuning for Agent AI
As ML Research Engineer, I conducted data curation and fine-tuning for Qwen 7B using QLoRA in the ADK (Agent Development Kit) domain. I prepared and labeled agentic coding data for model training and evaluation. My work focused on creating high-quality labeled datasets for code-related AI tasks. • Led fine-tuning dataset preparation with coding agent tasks. • Labeled coding traces and outputs for evaluation consistency. • Incorporated mock tool execution and trace verification labels. • Evaluated LLM-scored performance using labeled data benchmarks.