RLHF Preference Ranking & Fact-Checking for LLM Alignment
In this role, I served as a Subject Matter Expert to align Large Language Models with human values of helpfulness, honesty, and harmlessness. My work directly contributed to reducing model hallucinations and improving response accuracy for complex user queries.