AI Tools Researcher & Independent Tester
I tested and evaluated responses from AI language models such as ChatGPT, Claude, Grok, and Copilot for accuracy, clarity, safety, and reasoning. I performed RLHF, content annotation, and systematic text generation evaluation across numerous tasks and subject domains. I demonstrated strong instincts for identifying high-quality or flawed AI outputs directly relevant to AI training processes. • Analyzed hallucinations and factual inconsistencies. • Conducted preference data collection and response annotation. • Completed OpenTrain AI onboarding and annotation interview stages. • Utilized multiple AI training and annotation tools (Label Studio, Remotasks, Appen, OpenTrain AI, Scale AI).