For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
R

Rahel Mariwan

AI Model Evaluator at Outlier AI

United Kingdom flagLondon, United Kingdom
$25.00/hrIntermediateOther

Key Skills

Software

Other

Top Subject Matter

AI Model Evaluation
Multimodal Systems
Machine Learning & Data Science

Top Data Types

ImageImage
TextText
DocumentDocument

Top Task Types

Prompt + Response Writing (SFT)Prompt + Response Writing (SFT)
Computer Programming/CodingComputer Programming/Coding
TranscriptionTranscription
RLHFRLHF
Object DetectionObject Detection

Freelancer Overview

I have over a year of hands-on experience in AI model evaluation and data labeling, working with Outlier AI on large-scale multimodal and LLM projects. In this role I evaluated AI-generated outputs across image, video, and text tasks, designed structured rubrics to classify model responses, validated mathematical reasoning, and identified edge cases and inconsistencies to improve model reliability. I also worked extensively on the Aether coder project, applying transformation functions and Python-based analysis to assess model behaviour at a technical level. Alongside this, I bring a strong engineering background from Rolls-Royce and Alpha Plus Technologies, where I worked with complex technical data, validation processes, and detailed documentation. These skills translate directly into rigorous and precise AI data annotation work. I am highly detail-oriented, comfortable working with ambiguous data, and experienced in applying systematic, structured approaches to evaluate and improve AI systems at scale.

IntermediateEnglishKurdishDutch

Labeling Experience

AI Model Evaluator – Outlier AI

OtherImage
As an AI Model Evaluator at Outlier AI, I supported the evaluation and performance analysis of large language models (LLMs) and multimodal AI systems. My work focused on assessing AI outputs, identifying errors, and validating output consistency across custom image and video datasets. This included applying structured rubrics, testing transformation functions, and validating mathematical and visual reasoning. • Analysed AI-generated outputs for accuracy, consistency, and error detection. • Performed structured testing with image and video data using Python-based analysis tools. • Designed and applied evaluation rubrics to classify responses and inconsistencies. • Identified edge cases in multimodal model performance and documented findings.

As an AI Model Evaluator at Outlier AI, I supported the evaluation and performance analysis of large language models (LLMs) and multimodal AI systems. My work focused on assessing AI outputs, identifying errors, and validating output consistency across custom image and video datasets. This included applying structured rubrics, testing transformation functions, and validating mathematical and visual reasoning. • Analysed AI-generated outputs for accuracy, consistency, and error detection. • Performed structured testing with image and video data using Python-based analysis tools. • Designed and applied evaluation rubrics to classify responses and inconsistencies. • Identified edge cases in multimodal model performance and documented findings.

2026 - Present

Education

U

University of Warwick

Master of Engineering, Advanced Mechanical Engineering

Master of Engineering
2021 - 2022
C

Coventry University

Bachelor of Engineering, Mechanical Engineering

Bachelor of Engineering
2017 - 2020

Work History

D

Dilniya

Founder & Developer

London
2026 - Present
R

Rolls-Royce

Mechanical Engineer

Derby
2024 - 2025