Software Engineer, AI Annotator – Contractor
As a Software Engineer, AI Annotator contractor at Outlier AI, I evaluated the performance of AI models and agents. I assessed outputs for instruction-following, truthfulness, and completeness, providing critical feedback to improve system accuracy. Additionally, I analyzed AI agent coding solutions for open-source repositories in JavaScript, Python, and TypeScript. • Rated AI model responses for multiple metrics including instruction-following and completeness • Evaluated outputs of AI agents such as Grok and Claude for coding solution quality • Used web app interfaces and remote collaboration tools for annotation and evaluation tasks • Contributed to the overall improvement and tuning of AI systems for better model accuracy