AI Response Evaluator/ Aether
Evaluated AI-generated text responses for factual accuracy, instruction-following, and adherence to user prompts. Assessed outputs across multiple quality dimensions including integrity, safety compliance, and creativity. Provided structured ratings and feedback to support language model improvement and alignment with human preferences.