AI Tutor & Data Generalist
As an AI Tutor & Data Generalist at Aether, I assessed and rated AI-generated responses for natural language understanding and reasoning quality. My responsibilities included prompt engineering, creating adversarial test cases, and ranking model outputs. I collaborated with the AI team to improve factual accuracy, logic, and adherence to safety guidelines. • Evaluated model responses using established factual and safety criteria. • Developed and tested challenging input prompts to identify model limitations. • Provided structured feedback on model performance and reasoning. • Participated in tuning model instruction-following and general knowledge performance.