AI Developer task designer
Contributed to a data creation project aimed at improving frontier LLMs in solving realistic, developer-style tasks within terminal-based environments. The project focused on enabling AI systems to reason, plan, and execute solutions similarly to real software engineers. My role involved designing realistic, work-based problem scenarios that reflect actual developer workflows, defining clear task requirements, and validating task difficulty and feasibility. I participated in quality reviews to ensure tasks accurately tested instruction-following, technical reasoning, and tool usage in command-line environments. This work directly supported the creation of high-quality training and evaluation data for advanced AI agent development.