AI data annotation & LLM evaluation specialist
Evaluated and rated LLM generated responses for quality, accuracy, and guideline compliance. created and tested prompts , performed red teaming scenarios, and identified hallucinations and reasoning errors to improve AI performance.