For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
G
George

George

AI Domain Expert & LLM Evaluator

Kenya flagJuja, Kenya
$20.00/hrExpertMercorCVATCrowdsource

Key Skills

Software

MercorMercor
CVATCVAT
CrowdSourceCrowdSource
Data Annotation TechData Annotation Tech
Google Cloud Vertex AIGoogle Cloud Vertex AI
RemotasksRemotasks

Top Subject Matter

Technical STEM (mathematics, engineering)
Urban traffic surveillance
safety analytics

Top Data Types

TextText
VideoVideo
ImageImage

Top Task Types

SegmentationSegmentation
RLHFRLHF
Bounding BoxBounding Box
Object DetectionObject Detection
Text GenerationText Generation
TranscriptionTranscription
Computer Programming/CodingComputer Programming/Coding

Freelancer Overview

AI Domain Expert & LLM Evaluator. Brings 7+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal, Proprietary Tooling, and Video Annotation Platforms. Education includes Bachelor of Science, Jomo Kenyatta University of Agriculture and Technology (2024). AI-training focus includes data types such as Text and Video and labeling workflows including Evaluation, Rating, and Segmentation.

ExpertEnglishSwahili

Labeling Experience

LLM Red Teamer & RLHF Evaluator

TextRLHF
I performed reinforcement learning from human feedback (RLHF) tasks to optimize the reasoning capabilities and user safety of language models. My contributions included red teaming and stress testing models to identify and address logical inconsistencies. This process involved the design and critical evaluation of complex prompts to push model limits. • Designed and executed prompt-based stress tests for LLM safety. • Identified logical and factual weaknesses in LLM outputs. • Collaborated with teams to enhance model robustness against adversarial cases. • Delivered actionable insights for iterative model improvement.

I performed reinforcement learning from human feedback (RLHF) tasks to optimize the reasoning capabilities and user safety of language models. My contributions included red teaming and stress testing models to identify and address logical inconsistencies. This process involved the design and critical evaluation of complex prompts to push model limits. • Designed and executed prompt-based stress tests for LLM safety. • Identified logical and factual weaknesses in LLM outputs. • Collaborated with teams to enhance model robustness against adversarial cases. • Delivered actionable insights for iterative model improvement.

2026 - Present

Video Annotation Specialist (Urban Traffic)

VideoSegmentation
I designed and implemented complex annotation schemas for urban traffic surveillance video data. My work focused on translating raw footage into detailed, structured datasets for computer vision model training. High-volume, high-precision video segmentation and schema adherence were essential to the project's success. • Developed segmentation and object tracking schemas for traffic flow analysis. • Labeled and reviewed batches for high inter-annotator agreement and dataset integrity. • Collaborated on schema validation for edge-case detection (e.g., unusual traffic events). • Utilized advanced video annotation platforms to streamline data labeling workflows.

I designed and implemented complex annotation schemas for urban traffic surveillance video data. My work focused on translating raw footage into detailed, structured datasets for computer vision model training. High-volume, high-precision video segmentation and schema adherence were essential to the project's success. • Developed segmentation and object tracking schemas for traffic flow analysis. • Labeled and reviewed batches for high inter-annotator agreement and dataset integrity. • Collaborated on schema validation for edge-case detection (e.g., unusual traffic events). • Utilized advanced video annotation platforms to streamline data labeling workflows.

2026 - Present

AI Domain Expert & LLM Evaluator

Text
I conducted rigorous evaluations of Large Language Models, with a focus on technical accuracy in STEM-related content. My responsibilities included ranking and grading LLM outputs for factual correctness, depth of reasoning, and safety using RLHF methodologies. I created structured feedback to improve model responses and inform tuning cycles. • Led the assessment of AI-generated responses in mathematics and engineering topics. • Applied advanced prompt engineering to test and validate model boundaries. • Ensured data integrity and high inter-annotator agreement in LLM batches. • Utilized LLM sandboxes and evaluation platforms extensively.

I conducted rigorous evaluations of Large Language Models, with a focus on technical accuracy in STEM-related content. My responsibilities included ranking and grading LLM outputs for factual correctness, depth of reasoning, and safety using RLHF methodologies. I created structured feedback to improve model responses and inform tuning cycles. • Led the assessment of AI-generated responses in mathematics and engineering topics. • Applied advanced prompt engineering to test and validate model boundaries. • Ensured data integrity and high inter-annotator agreement in LLM batches. • Utilized LLM sandboxes and evaluation platforms extensively.

2026 - Present

STEM Response Verification Expert

Text
I served as a subject matter expert to audit and correct AI-generated solutions in mathematics and engineering. The work ensured that technical training data met professional standards through precise review and validation. My responsibilities included batch corrections, error flagging, and standards compliance for LLM datasets. • Reviewed and corrected technical AI-generated outputs for accuracy. • Flagged and documented common LLM reasoning errors in STEM topics. • Delivered high-quality audits with detailed recommendations for data improvement. • Contributed to the development of quality benchmarks for complex problems.

I served as a subject matter expert to audit and correct AI-generated solutions in mathematics and engineering. The work ensured that technical training data met professional standards through precise review and validation. My responsibilities included batch corrections, error flagging, and standards compliance for LLM datasets. • Reviewed and corrected technical AI-generated outputs for accuracy. • Flagged and documented common LLM reasoning errors in STEM topics. • Delivered high-quality audits with detailed recommendations for data improvement. • Contributed to the development of quality benchmarks for complex problems.

2025 - Present

Urban Traffic Surveillance Schema Annotator

VideoSegmentation
I contributed to a high-granularity video annotation project for urban traffic flow and safety AI models. My responsibilities included segmenting video sequences and labeling key features relevant to machine learning development. The outcome ensured that datasets were comprehensive for subsequent model training and validation. • Collaborated on schema definition and review for video datasets. • Provided quality assurance for annotation consistency and coverage. • Liaised with engineering and modeling teams on labeling requirements. • Assisted in integrating annotated data into AI training pipelines.

I contributed to a high-granularity video annotation project for urban traffic flow and safety AI models. My responsibilities included segmenting video sequences and labeling key features relevant to machine learning development. The outcome ensured that datasets were comprehensive for subsequent model training and validation. • Collaborated on schema definition and review for video datasets. • Provided quality assurance for annotation consistency and coverage. • Liaised with engineering and modeling teams on labeling requirements. • Assisted in integrating annotated data into AI training pipelines.

2025 - Present

Education

K

Kenswed College

Certification in PHP, Python & MySQL, Programming

Certification in PHP, Python & MySQL
2023 - 2024
J

Jomo Kenyatta University of Agriculture and Technology

Bachelor of Science, Mechatronic Engineering

Bachelor of Science
2024

Work History

J

Jomo Kenyatta University of Agriculture and Technology

Engineering Student Assistant

Juja
2024 - Present
F

Freelance

Transcription Specialist

Nairobi
2020 - Present