Teaching AI To Be More Honest - Long Instructions
Using an external website with a specialised AI Trainer Account. This project requested a long conversation with an LLM under development (10+ turns) with at least 20+ constraints per prompt. The goal of this project was to get the LLM to fail instruction following in long conversations and with a large number of parameters. The LLM produced 2 responses per prompt, and I had to rate it along different rating axes, such as factuality, writing quality, formatting, appropriate length and which followed the instructions better. This project takes 5-7+ hours to complete fully.