INDEPENDENT CONTRACTOR
Worked on cutting-edge AI training tasks focused on improving model behavior through Reinforcement Learning with Human Feedback (RLHF) and API integration. My contributions included: RLHF Evaluation: Assessed and ranked AI-generated responses based on alignment with human values, helpfulness, clarity, and factual correctness. Helped shape reward models used for aligning large language models with real-world user expectations. Multi-Turn Ranking: Evaluated complex, multi-turn conversations to ensure consistency, coherence, and high-quality responses across dialogue chains. API Calling Tasks: Designed and tested prompt structures for tool-augmented tasks where models interact with APIs. Ensured correct, context-aware API execution based on natural language input and task goals. Focused on crafting realistic and challenging scenarios that pushed models to demonstrate reasoning, tool usage, and response accuracy in dynamic contexts.