George - AI Domain Expert & LLM Evaluator

Key Skills

Software

Mercor

CVAT

CrowdSource

Data Annotation Tech

Google Cloud Vertex AI

Remotasks

Top Subject Matter

Technical STEM (mathematics, engineering)

Urban traffic surveillance

safety analytics

Top Data Types

Text

Video

Image

Top Task Types

Segmentation

RLHF

Bounding Box

Object Detection

Text Generation

Transcription

Computer Programming/Coding

Freelancer Overview

AI Domain Expert & LLM Evaluator. Brings 7+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal, Proprietary Tooling, and Video Annotation Platforms. Education includes Bachelor of Science, Jomo Kenyatta University of Agriculture and Technology (2024). AI-training focus includes data types such as Text and Video and labeling workflows including Evaluation, Rating, and Segmentation.

ExpertEnglishSwahili

Labeling Experience

LLM Red Teamer & RLHF Evaluator

TextRLHF

I performed reinforcement learning from human feedback (RLHF) tasks to optimize the reasoning capabilities and user safety of language models. My contributions included red teaming and stress testing models to identify and address logical inconsistencies. This process involved the design and critical evaluation of complex prompts to push model limits. • Designed and executed prompt-based stress tests for LLM safety. • Identified logical and factual weaknesses in LLM outputs. • Collaborated with teams to enhance model robustness against adversarial cases. • Delivered actionable insights for iterative model improvement.

2026 - Present

Video Annotation Specialist (Urban Traffic)

VideoSegmentation

I designed and implemented complex annotation schemas for urban traffic surveillance video data. My work focused on translating raw footage into detailed, structured datasets for computer vision model training. High-volume, high-precision video segmentation and schema adherence were essential to the project's success. • Developed segmentation and object tracking schemas for traffic flow analysis. • Labeled and reviewed batches for high inter-annotator agreement and dataset integrity. • Collaborated on schema validation for edge-case detection (e.g., unusual traffic events). • Utilized advanced video annotation platforms to streamline data labeling workflows.

2026 - Present

AI Domain Expert & LLM Evaluator

Text

I conducted rigorous evaluations of Large Language Models, with a focus on technical accuracy in STEM-related content. My responsibilities included ranking and grading LLM outputs for factual correctness, depth of reasoning, and safety using RLHF methodologies. I created structured feedback to improve model responses and inform tuning cycles. • Led the assessment of AI-generated responses in mathematics and engineering topics. • Applied advanced prompt engineering to test and validate model boundaries. • Ensured data integrity and high inter-annotator agreement in LLM batches. • Utilized LLM sandboxes and evaluation platforms extensively.

2026 - Present

STEM Response Verification Expert

Text

I served as a subject matter expert to audit and correct AI-generated solutions in mathematics and engineering. The work ensured that technical training data met professional standards through precise review and validation. My responsibilities included batch corrections, error flagging, and standards compliance for LLM datasets. • Reviewed and corrected technical AI-generated outputs for accuracy. • Flagged and documented common LLM reasoning errors in STEM topics. • Delivered high-quality audits with detailed recommendations for data improvement. • Contributed to the development of quality benchmarks for complex problems.

2025 - Present

Urban Traffic Surveillance Schema Annotator

VideoSegmentation

I contributed to a high-granularity video annotation project for urban traffic flow and safety AI models. My responsibilities included segmenting video sequences and labeling key features relevant to machine learning development. The outcome ensured that datasets were comprehensive for subsequent model training and validation. • Collaborated on schema definition and review for video datasets. • Provided quality assurance for annotation consistency and coverage. • Liaised with engineering and modeling teams on labeling requirements. • Assisted in integrating annotated data into AI training pipelines.

2025 - Present

Education

K

Kenswed College

Certification in PHP, Python & MySQL, Programming

Certification in PHP, Python & MySQL

2023 - 2024

J

Jomo Kenyatta University of Agriculture and Technology

Bachelor of Science, Mechatronic Engineering

Bachelor of Science

2024

Work History

J

Jomo Kenyatta University of Agriculture and Technology

Engineering Student Assistant

Juja

2024 - Present

F

Freelance

Transcription Specialist

Nairobi

2020 - Present