For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Mary N

Mary N

AI Training & Evaluation Specialist (Freelance)

KENYA flag
Nairobi, Kenya
$8.00/hrIntermediateOther

Key Skills

Software

Other

Top Subject Matter

Large Language Models
AI Evaluation
Machine Learning

Top Data Types

ImageImage
VideoVideo
TextText

Top Label Types

Segmentation
Object Detection
Action Recognition
Evaluation Rating
Transcription
Classification
Question Answering
Emotion Recognition
RLHF
Prompt Response Writing SFT

Freelancer Overview

AI Training & Evaluation Specialist (Freelance). Brings 5+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Other. Education includes Master of Science, Technical University of Munich (2024) and Master of Science, University of Nairobi (2022). AI-training focus includes data types such as Text and labeling workflows including Evaluation, Rating, and Classification.

IntermediateEnglish

Labeling Experience

AI Training & Evaluation Specialist (Freelance)

OtherText
As an AI Training & Evaluation Specialist (Freelance) at Turing, I evaluated and compared AI model responses for reasoning quality and factual accuracy. I performed structured feedback to improve dataset quality and supported claim verification and hallucination detection for LLMs. My work contributed to building high accuracy datasets for AI training and evaluation. • Compared and rated AI-generated text responses for quality and factual accuracy. • Performed hallucination detection and claim verification tasks on LLM outputs. • Provided subjective and guideline-based assessments for dataset improvement. • Supported ongoing enhancement of large language model reliability.

As an AI Training & Evaluation Specialist (Freelance) at Turing, I evaluated and compared AI model responses for reasoning quality and factual accuracy. I performed structured feedback to improve dataset quality and supported claim verification and hallucination detection for LLMs. My work contributed to building high accuracy datasets for AI training and evaluation. • Compared and rated AI-generated text responses for quality and factual accuracy. • Performed hallucination detection and claim verification tasks on LLM outputs. • Provided subjective and guideline-based assessments for dataset improvement. • Supported ongoing enhancement of large language model reliability.

2025 - Present

Video Annotation

OtherVideoSegmentationObject Detection
This involved reviewing video footage frame by frame and accurately labeling objects, actions, or events according to detailed project guidelines. I tracked moving objects across multiple frames, applied bounding boxes or tags consistently over time, and classified scenes or behaviors to help improve the model’s ability to detect patterns in dynamic environments.

This involved reviewing video footage frame by frame and accurately labeling objects, actions, or events according to detailed project guidelines. I tracked moving objects across multiple frames, applied bounding boxes or tags consistently over time, and classified scenes or behaviors to help improve the model’s ability to detect patterns in dynamic environments.

2025

Train AI

OtherImageSegmentation
part of my role with RWS on the Train AI – Project Diamond, I performed image annotation to support computer vision model training. This involved carefully reviewing images and labeling specific objects, text, or visual elements according to detailed annotation guidelines. I identified and tagged items within images, categorized them accurately, and ensured consistency in labeling to help the AI model correctly recognize patterns and visual features. In some tasks, I drew bounding boxes around objects, classified images into predefined categories, or verified whether annotations met quality standards.

part of my role with RWS on the Train AI – Project Diamond, I performed image annotation to support computer vision model training. This involved carefully reviewing images and labeling specific objects, text, or visual elements according to detailed annotation guidelines. I identified and tagged items within images, categorized them accurately, and ensured consistency in labeling to help the AI model correctly recognize patterns and visual features. In some tasks, I drew bounding boxes around objects, classified images into predefined categories, or verified whether annotations met quality standards.

2025

E commerce Annotation

OtherTextClassificationQuestion Answering
My role included reviewing AI-generated responses to customer inquiries, evaluating them for accuracy, helpfulness, tone, and policy compliance, and ranking or correcting responses based on established guidelines. I ensured that replies addressed customer intent clearly, maintained professionalism, and aligned with brand and safety standards. In addition, I identified intent categories such as order tracking, refunds, product inquiries, delivery timelines, and returns, tagging conversations accordingly to improve intent recognition models. The task required strong comprehension skills, contextual reasoning, and attention to nuance to ensure responses were customer-centered and solution-oriented. My contributions supported the refinement of conversational AI systems to enhance response relevance, customer satisfaction, and overall chatbot performance.

My role included reviewing AI-generated responses to customer inquiries, evaluating them for accuracy, helpfulness, tone, and policy compliance, and ranking or correcting responses based on established guidelines. I ensured that replies addressed customer intent clearly, maintained professionalism, and aligned with brand and safety standards. In addition, I identified intent categories such as order tracking, refunds, product inquiries, delivery timelines, and returns, tagging conversations accordingly to improve intent recognition models. The task required strong comprehension skills, contextual reasoning, and attention to nuance to ensure responses were customer-centered and solution-oriented. My contributions supported the refinement of conversational AI systems to enhance response relevance, customer satisfaction, and overall chatbot performance.

2025 - 2025

AI Data Annotator (Freelance)

OtherTextClassification
As an AI Data Annotator (Freelance) at RWS Moravia, I annotated and validated large-scale datasets for machine learning model training. I followed complex linguistic and technical guidelines across various data annotation tasks. My responsibilities included resolving ambiguous cases and supporting multilingual labeling workflows. • Applied detailed labeling instructions for textual data used in AI training. • Conducted data validation and quality checks for diverse datasets. • Ensured high labeling accuracy through guideline interpretation. • Participated in global AI data labeling projects involving structured and unstructured data.

As an AI Data Annotator (Freelance) at RWS Moravia, I annotated and validated large-scale datasets for machine learning model training. I followed complex linguistic and technical guidelines across various data annotation tasks. My responsibilities included resolving ambiguous cases and supporting multilingual labeling workflows. • Applied detailed labeling instructions for textual data used in AI training. • Conducted data validation and quality checks for diverse datasets. • Ensured high labeling accuracy through guideline interpretation. • Participated in global AI data labeling projects involving structured and unstructured data.

2024 - 2025

Education

U

University of Nairobi

Master of Science, Project Planning and Management (Environmental Studies)

Master of Science
2023 - 2025
T

Technical University of Munich

Master of Science, Product Development

Master of Science
2024 - 2024

Work History

R

RWS Remote

Data Specialist -Freelance

Nairobi
2024 - Present
M

Momentum

Data Analyst

Nairobi
2021 - 2024