AI Q/A and Red Teaming Analyst
As an AI Q/A and Red Teaming Analyst, I reviewed and refined outputs from large language models. My work focused on technical prompt accuracy, feedback generation, and error pattern discovery to enhance LLM training. This included structured evaluations, red teaming for model robustness, and jailbreaking tests. • Utilized Labelbox to refine and rate LLM outputs • Produced structured feedback and recurring error analyses • Specialized in evaluation, classification, and complex Q/A annotation • Improved training accuracy and robustness for NLP models