Annotator / Model Tester / Annotator
As a freelance Annotator and Model Tester for Mercor, I created and tested hundreds of prompts to benchmark and refine AI model outputs. I maintained high accuracy and throughput in large-scale annotation workflows. My work supported WideSearch and BrowseComp benchmarking projects with a focus on quality and consistency. • Developed effective prompts for LLM evaluation • Collaborated in refining AI behavior • Performed large-scale annotation tasks • Maintained excellent accuracy metrics