Senior AI Evaluation Analyst (Remote)
Independent Contractor | 2022 – Present Reviewed and evaluated 40,000+ AI-generated outputs across multiple domains including text and visual datasets. Applied structured evaluation metrics for relevance, factual correctness, and reasoning quality. Designed labeling guidelines and scoring rubrics to ensure consistent annotation across teams. Maintained >99% accuracy while working on high-volume annotation tasks. Documented recurring model errors and dataset inconsistencies to improve AI training data.