LLM Response Evaluation & RLHF Ranking Project
Participated in Large Language Model (LLM) training through response evaluation, ranking, and supervised fine-tuning (SFT) tasks. Assessed outputs for accuracy, relevance, safety, and instruction adherence. Provided structured feedback to improve model alignment and overall response quality within RLHF workflows.