Data Annotator
1. Multimodal STEM reasoning style questions that stump state of the art thinking models. 2. GTFA reasoning style question that challenges the boundary of best LLM models. 3. STEM Rating on Outlier is back! Evaluate the quality of two model responses to an everyday user's request and the compare versus SOTA models.