LLM Evaluator & Red Team Prompt Contributor – Twitter AI (X)
Contributed to Twitter AI (X) by crafting adversarial prompts, performing model evaluations, and suggesting completions to enhance LLM robustness. Conducted systematic adversarial and edge-case testing to reduce hallucinations and improve model alignment. Provided qualitative and quantitative ratings on model-generated outputs. • Wrote complex, domain-specific evaluation prompts for red-teaming purposes. • Performed rating of LLM outputs against specified guidelines. • Identified and documented common model failures and edge cases. • Advised on model alignment improvements for future data labeling cycles.