For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
B

Brian Ndege

Senior AI Evaluation & QA Specialist

KENYA flag
Nairobi, Kenya
$20.00/hrExpertOther

Key Skills

Software

Other

Top Subject Matter

LLM Evaluation
AI Agent QA
Workflow Simulation

Top Data Types

TextText

Top Task Types

Entity Ner Classification

Freelancer Overview

Senior AI Evaluation & QA Specialist. Brings 2+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal, Proprietary Tooling, and Other. Education includes Master of Science, University of Nairobi (2022) and Bachelor of Science, Moi University (2021). AI-training focus includes data types such as Text and labeling workflows including Evaluation, Rating, and Entity (NER) Classification.

ExpertSwahiliEnglish

Labeling Experience

Senior AI Evaluation & QA Specialist

Text
In this role, I created structured evaluation scenarios to test large language model (LLM) agents in simulating real-world workflows. I established gold-standard outputs and acceptable response ranges to assess model performance and consistency. I also developed and maintained scenario templates using JSON and YAML for enhanced QA coverage. • Led scenario-based QA reviews identifying logical inconsistencies. • Implemented validation frameworks to boost model reliability. • Used tools such as Postman, Jira, and TestRail for workflow and evaluation. • Collaborated with AI developers to improve evaluation frameworks.

In this role, I created structured evaluation scenarios to test large language model (LLM) agents in simulating real-world workflows. I established gold-standard outputs and acceptable response ranges to assess model performance and consistency. I also developed and maintained scenario templates using JSON and YAML for enhanced QA coverage. • Led scenario-based QA reviews identifying logical inconsistencies. • Implemented validation frameworks to boost model reliability. • Used tools such as Postman, Jira, and TestRail for workflow and evaluation. • Collaborated with AI developers to improve evaluation frameworks.

2023 - Present

AI Data Analyst & QA Engineer

OtherTextEntity Ner Classification
I performed natural language processing (NLP) annotation and dataset validation for machine learning use cases. The role involved designing test cases for chatbots and automated evaluation pipelines to check AI output accuracy. I also developed scoring rubrics and documented edge cases for QA. • Annotated NLP datasets for model training and validation. • Built Python scripts for automated output testing. • Worked with Jupyter Notebook, VS Code, and MongoDB. • Focused on conversational AI and chatbot datasets.

I performed natural language processing (NLP) annotation and dataset validation for machine learning use cases. The role involved designing test cases for chatbots and automated evaluation pipelines to check AI output accuracy. I also developed scoring rubrics and documented edge cases for QA. • Annotated NLP datasets for model training and validation. • Built Python scripts for automated output testing. • Worked with Jupyter Notebook, VS Code, and MongoDB. • Focused on conversational AI and chatbot datasets.

2020 - 2022

Education

M

Moi University

Bachelor of Science, Computer Science

Bachelor of Science
2018 - 2021
U

University of Nairobi

Master of Science, Information Technology

Master of Science
2022

Work History

T

TechSavanna Solutions Ltd

Senior AI Evaluation & QA Specialist

Nairobi
2023 - Present
U

Uasin Gishu Digital Systems

Junior Software Tester

Eldoret
2019 - 2020