AI Trainer – Advanced LLM Evaluation Projects
As an AI Trainer at RWS, I contributed to advanced LLM evaluation projects involving reasoning assessment and dataset refinement. My main task was reviewing and optimizing English and multilingual AI responses for tone, grounding, and logical structure. The focus included chatbot consistency and accuracy benchmarking in multiple projects. • Evaluated Diamond (Multimango & Parimango) and Ruby projects. • Reviewed model outputs for factual correctness and coherence. • Refined datasets through complex reasoning evaluation. • Enhanced response consistency in English and multilingual settings.