AI Prompt & Response Evaluation (Outlier — Aether)
On the AI Prompt & Response Evaluation project for Outlier, I executed multiple AI annotation tasks focusing on prompt rating and reference answer writing. The work required strong written reasoning, detailed analysis, and decisive flagging of problematic content. Consistency and accuracy were maintained throughout varied content domains. • Rated model-generated outputs for correctness and quality. • Authored reference responses for prompt benchmarking. • Identified and flagged unsafe, factually incorrect, or off-topic responses. • Demonstrated robust attention to detail and adherence to guidelines.