Project Raven
The project was focused on the evaluation of LLM responses to audio prompts recorded by other workers. The tone, pitch and emotion of these audio files had to be labeled, and the LLM response rated accordingly. The project had more than 5k tasks and lasted for a bit less than a month.