AI Content Generalist | Aether Project (Outlier)
This role involved assessing, evaluating, and providing structured feedback on Large Language Model (LLM) outputs. Responsibilities included adversarial prompt writing and quantitative as well as qualitative evaluation for AI language systems. The position prioritized rigorous verification of outputs and compliance with evolving task taxonomies, all within high-volume, quality-critical workflows. • Model responses were checked against reliable sources for factual accuracy. • Developed and tested intricate multi-turn prompts to challenge AI guardrails. • Delivered detailed qualitative ratings on model logic and conversational ability. • Maintained strict adherence to project quality and compliance guidelines.