Remote AI Generalist
Compared AI system outputs to evaluate model performance. Utilized rubrics to rate and select the best-performing AI for each scenario. Wrote and refined prompts to enhance large language model results. • Collaborated in sustained AI evaluation through Outlier AI platform • Provided structured feedback to inform model improvement • Maintained rating consistency and adherence to project standards • Engaged in prompt design for diverse subject areas