LLM Evaluator & AI Response Reviewer | Outlier AI
As an LLM Evaluator & AI Response Reviewer at Outlier AI, I evaluated chatbot responses for helpfulness, correctness, and safety using structured criteria. I performed pairwise comparison tasks and flagged unsafe or biased content to improve AI performance. I ensured that all reviews were well-documented and justified per project requirements. • Rated and evaluated chatbot-generated text. • Flagged biased, toxic, or unsafe responses based on review guidelines. • Provided detailed written justifications for decisions. • Ensured reliability and safety of model outputs.