AI Response Evaluation Specialist — Independent Practice
As an AI Response Evaluation Specialist, I conducted large-scale assessment of AI-generated written content for accuracy, relevance, and instruction-following. I systematically ranked multiple AI outputs using structured rubrics, simulating RLHF and preference data collection workflows. My responsibilities included identifying errors, inconsistencies, and bias in AI responses and clearly documenting rationale and outcomes. • Conducted hundreds of detailed evaluation tasks on diverse prompts. • Practiced red-teaming by identifying failure modes and assessing robustness. • Maintained structured and consistent scoring records across all tasks. • Utilized written communication strength to ensure clarity in judgments.