Evaluation
I am a reviewer on the Evals project at Scale AI The project remains active across multiple countries My role involves evaluating pairs of AI responses based on: Truthfulness Accuracy Overall quality Formatting Harmfulness / safety considerations