AI Model Training Associate
As an AI Model Training Associate, I curated, cleaned, and annotated over 50,000 data samples for supervised learning in NLP and computer vision tasks. I fine-tuned large language models using both LoRA and full fine-tuning methods. My work included implementing RLHF pipelines and human evaluation rubrics. • Labeled both text and image data for NLP (such as NER, sentiment) and vision tasks • Used Label Studio and CVAT for data annotation processes • Applied BLEU, ROUGE, and human evaluations for quality control • Collaborated remotely with research teams and documented all training experiments