AI Trainer - LLM Evaluation
Contributing to a LLM rating project (Portuguese) on the SRT platform. I take conversations with AI agents in order to rank their responses based on key aspects (language match, presentation, grammar issues, fluency) and to tag issues related to false refusal and templated outputs.