AI Trainer / Code Reviewer
As an AI Trainer and Code Reviewer at Outlier, I specialize in high-level data labeling through the auditing and optimization of AI-generated code from various SOTA models. My work focuses on advanced model alignment using RLHF and RLVR protocols to enhance complex algorithmic generation and verifiable reasoning. To ensure production-grade quality, I implement the Model Context Protocol (MCP) and tool-calling to bridge the gap between AI models and external API ecosystems.