AI Response Evaluator (Freelance)
Evaluated AI‑generated Japanese text responses for accuracy, naturalness, coherence, and user relevance. Ranked multiple model outputs and identified issues such as ambiguity, hallucinations, and inconsistencies. Applied detailed evaluation guidelines and provided structured feedback to improve model quality. Reviewed prompts, analyzed model reasoning, and assessed linguistic clarity and tone in Japanese.