LLM Post-Training Intern
As an LLM Post-Training Intern at ETHARA AI (Green Rider Technology LLP), I participated in the post-training process for Large Language Models using methods such as Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF). My responsibilities included data annotation, model evaluation, and prompt refinement to enhance AI model accuracy. I focused on ensuring the quality and confidentiality of training data while collaborating with the AI team for optimal results. • Conducted data annotation for LLMs, refining prompts and responses. • Evaluated models using SFT and RLHF approaches to optimize performance. • Checked and maintained data quality, adhering to strict ethical and confidentiality standards. • Collaborated with team members to implement feedback and improve AI training procedures.