Gen-AI Intern, LLM Fine-tuning
I fine-tuned a Large Language Model (LLM) using PEFT (QLoRA) on custom datasets targeted for internal QA. The process included data preparation, model adaptation, and performance benchmarking. This initiative advanced internal LLM capabilities for domain-specific question answering tasks.• Fine-tuned a language model using state-of-the-art parameter-efficient techniques. • Utilized Hugging Face Transformers, TRL (SFTTrainer), and BitsAndBytes for optimization. • Created and managed domain-appropriate training data for supervised model updating. • Benchmarked the adapted model on internal QA tasks for efficacy assessment.