LLM Safety and Quality Evaluation — Outlier Platform
Reviewed and improved model responses for general knowledge, reasoning, and writing quality. Wrote prompts, rated answers, and suggested edits to increase clarity, factual accuracy, and safety. Performed red teaming to probe for harmful or biased outputs, then documented failure cases for guideline updates. Maintained strong reviewer scores while adapting to frequent instruction changes and tight turnarounds.