Language Preference Writing and Rewrite - RLHF
This project consisted in initiating queries based on specific domains, and evaluating the two responses generated by the two models. Some tasks included 2 to 4 turns, whereby the contributor had to initiate the number of queries that were required and respect the domains and ensure a logical follow up of the queries. The chosen responses were eventually rewritten and justification were provided to support the judgement that was made.