Project Sheldon
We were working on improving LLaMA by generating very challenging prompts with a single verifiable answer. The goal was to fail the model at least 5 times, if does not fail times you twist the prompt further and make it harder for the model to provide an accurate answer. This was meant to make the model be able to solve very hard and critical questions.