AI Trainer
Scope: dvanced reasoning domains: Mathematics, scientific analysis, legal reasoning, software engineering, and other technical fields requiring deep expertise Model alignment work: Training frontier AI models to be more helpful, accurate, and safe through reinforcement learning from human feedback (RLHF) Safety and quality assurance: Identifying hallucinations, biases, and potential risks in model outputs Multi-step problem solving: Tasks requiring chain-of-thought evaluation and verification across complex workflows Tasks: Evaluating and ranking model responses for technical accuracy and clarity Writing detailed, expert-level explanations for complex topics Red-teaming models to identify failure modes and safety concerns Annotating reasoning chains and identifying logical errors Creating high-quality prompt/response pairs for specialized domains Multi-turn conversation evaluation and refinement