AI Agent Evaluation Analyst / Data Verification Specialist
Evaluated AI-generated outputs for logical consistency, factual accuracy, and safety compliance as part of AI agent workflow assessments. Applied Reinforcement Learning from Human Feedback (RLHF) methodologies to enhance AI reliability. Developed standardized benchmark responses to help troubleshoot multi-step workflows in automated systems. • Assessed generated text outputs for compliance and accuracy. • Created benchmarks for AI output evaluation and error reduction. • Provided feedback on subtle workflow and logic gaps within agent systems. • Used internal tools to audit and document process optimizations.