AI Trainer / Model Evaluator (Web3-Integrated Projects)
Evaluated over 2,000 AI responses in reasoning, instruction-following, and domain-specific contexts. Performed comparative rankings and RLHF grading tasks to support model alignment and quality. Developed and optimized structured prompts to enhance response clarity. • Conducted regular identification of hallucination and safety issues • Maintained high evaluation accuracy and consistent internal quality scores • Reported on recurring task difficulties to improve process efficiency • Contributed to higher reliability in AI-generated outputs.