RLHF Advanced Technical Reasoning
Contributed to alignment and fine-tuning of a frontier Large Language Model. Focused on Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF) to improve model performance in complex, multi-step reasoning tasks. Performed ranking and preference labeling on model outputs to minimize hallucinations and ensure technical factualness.