Whiteboard (Technical Reasoning & STEM)
I serve as a Technical SME for Project Whiteboard, focusing on the rigorous evaluation of AI responses to complex mathematical and logical prompts. My role involved auditing the step-by-step reasoning (Chain-of-Thought) of the model to ensure technical accuracy and clarity. I provided detailed feedback and RLHF (Reinforcement Learning from Human Feedback) to refine the model's ability to solve multi-stage STEM problems without logical fallacies.