LLM post training intern
I worked as a LLM trainer mainly focusing on data annotation and response evaluation to improve a LLMs performance. I also worked on concepts such as reinforcement learning from human feedback (RLHF) and retrieval augmented generation (RAG).