AI Evaluation Specialist (Freelance/Remote)
Evaluated AI-generated scientific and analytical text for factual accuracy, logical structure, and response quality. Applied subject matter expertise in environmental chemistry, toxicology, and risk assessment to perform nuanced output assessment. Used structured reasoning to decompose and review complex, multi-step outputs for RLHF/preference ranking workflows. • Performed human-in-the-loop quality assurance for AI text outputs • Detected and flagged subtle errors and inconsistencies inaccessible to non-experts • Employ structured evaluation frameworks inspired by research methodologies • Contributed to AI evaluation process improvements and documentation