For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Geoffrey Audia

Geoffrey Audia

Freelance LLM Test Designer & QA Engineer – LLM/AI Model Evaluation & Testing

Kenya flagNairobi, Kenya
$8.00/hrExpertClickworkerAxiom AICrowdsource

Key Skills

Software

ClickworkerClickworker
Axiom AI
CrowdSourceCrowdSource
Data Annotation TechData Annotation Tech

Top Subject Matter

Large Language Models
Conversational AI
AI Safety

Top Data Types

TextText
DocumentDocument
VideoVideo

Top Task Types

Classification
Polygon
Point Key Point
Entity Ner Classification
Cuboid

Freelancer Overview

Freelance LLM Test Designer & QA Engineer – LLM/AI Model Evaluation & Testing. Brings 14+ years of professional experience across legal operations, contract review, compliance, and structured analysis. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Science, University of Nairobi (2012). AI-training focus includes data types such as Text and labeling workflows including Evaluation and Rating.

ExpertEnglish

Labeling Experience

Freelance LLM Test Designer & QA Engineer – LLM/AI Model Evaluation & Testing

Text
As a Freelance LLM Test Designer & QA Engineer, I designed and executed data labeling tasks focused on the evaluation of large language model outputs. My work included authoring evaluation criteria, data-driven test cases, and structured quality assessments for LLM-generated content. I integrated automated evaluation pipelines, using JSON and YAML formats to manage configurations and results consistently. • Designed test suites for assessing intent classification, slot filling, and conversational accuracy in LLM-powered chatbots. • Performed adversarial prompt/injection testing for AI safety using red-teaming methods on LLMs. • Authored and validated model output data for instruction-following, coherence, and hallucination detection. • Built CI-integrated frameworks to systematically rate and regress model output quality over time.

As a Freelance LLM Test Designer & QA Engineer, I designed and executed data labeling tasks focused on the evaluation of large language model outputs. My work included authoring evaluation criteria, data-driven test cases, and structured quality assessments for LLM-generated content. I integrated automated evaluation pipelines, using JSON and YAML formats to manage configurations and results consistently. • Designed test suites for assessing intent classification, slot filling, and conversational accuracy in LLM-powered chatbots. • Performed adversarial prompt/injection testing for AI safety using red-teaming methods on LLMs. • Authored and validated model output data for instruction-following, coherence, and hallucination detection. • Built CI-integrated frameworks to systematically rate and regress model output quality over time.

2020 - Present

Education

U

University of Nairobi

Bachelor of Science, Computer Science

Bachelor of Science
2012 - 2012

Work History

I

Independent

Freelance LLM Test Designer & QA Engineer

Nairobi
2020 - Present
S

Safaricom

Senior QA Automation Engineer

Nairobi
2016 - 2020