AI Content Contributor / Evaluator
As an AI Content Contributor and Evaluator at Outlier, Crowdgen, and Oneforma, I systematically assessed AI-generated textual responses for accuracy, coherence, and safety. My work involved identifying hallucinations, logical inconsistencies, and adherence to complex project rubrics. I provided detailed written feedback to help model teams improve language model performance. • Evaluated and rated responses using structured scoring rubrics. • Analyzed outputs for factuality, reasoning errors, and compliance with nuanced guidelines. • Performed side-by-side comparative testing and delivered actionable feedback. • Maintained a high accuracy rate while adapting to evolving project requirements.