ICT Officer & AI Evaluation Specialist
Evaluated and ranked outputs from Large Language Models (LLMs) across multiple modalities including text, image, audio, and video for research and improvement purposes. Provided qualitative feedback to enhance model reasoning, relevance, and accuracy following structured rubrics and guidelines. Collaborated closely with AI research teams remotely to boost model reliability and ensure dataset quality. • Assessed outputs in various modalities. • Followed strict evaluation guidelines and rubrics. • Maintained consistent accuracy while handling high volumes. • Worked asynchronously with research teams.