AI Training Specialist & RLHF Analyst
As an AI Training Specialist & RLHF Analyst at xAI, I conducted RLHF to optimize Large Language Models, ensuring high accuracy in instruction following. I specialized in verifying multi-step logic puzzles and arithmetic chains, applying Chain-of-Thought methodologies and code literacy checks. The work involved fact-checking, code review, and logic evaluation of LLM outputs for quality control. • Optimized LLMs' instruction following using RLHF iterative cycles. • Evaluated code snippets and web content for factual accuracy and logic flow. • Applied structured reasoning workflows, using Excel for AI-generated calculation verification. • Conducted real-time web retrieval assessments tailored to domain-specific queries.