Cypher-RLHF
One of my key projects involved creating intentionally flawed responses related to Korean traditional culture, historical facts, transportation, and geographical features to identify specific areas where AI models failed. By analyzing these failures, I provided valuable insights to enhance model performance. Additionally, I conducted detailed evaluations of RLHF assessments performed by others, assessing the quality and completeness of their reviews. These tasks required a meticulous approach to identify shortcomings in the evaluations and ensure alignment with RLHF goals.