Rahel Mariwan - AI Model Evaluator at Outlier AI

Key Skills

Software

Other

Top Subject Matter

AI Model Evaluation

Multimodal Systems

Machine Learning & Data Science

Top Data Types

Image

Text

Document

Top Task Types

Prompt + Response Writing (SFT)

Computer Programming/Coding

Transcription

RLHF

Object Detection

Freelancer Overview

I have over a year of hands-on experience in AI model evaluation and data labeling, working with Outlier AI on large-scale multimodal and LLM projects. In this role I evaluated AI-generated outputs across image, video, and text tasks, designed structured rubrics to classify model responses, validated mathematical reasoning, and identified edge cases and inconsistencies to improve model reliability. I also worked extensively on the Aether coder project, applying transformation functions and Python-based analysis to assess model behaviour at a technical level. Alongside this, I bring a strong engineering background from Rolls-Royce and Alpha Plus Technologies, where I worked with complex technical data, validation processes, and detailed documentation. These skills translate directly into rigorous and precise AI data annotation work. I am highly detail-oriented, comfortable working with ambiguous data, and experienced in applying systematic, structured approaches to evaluate and improve AI systems at scale.

IntermediateEnglishKurdishDutch

Labeling Experience

AI Model Evaluator – Outlier AI

OtherImage

As an AI Model Evaluator at Outlier AI, I supported the evaluation and performance analysis of large language models (LLMs) and multimodal AI systems. My work focused on assessing AI outputs, identifying errors, and validating output consistency across custom image and video datasets. This included applying structured rubrics, testing transformation functions, and validating mathematical and visual reasoning. • Analysed AI-generated outputs for accuracy, consistency, and error detection. • Performed structured testing with image and video data using Python-based analysis tools. • Designed and applied evaluation rubrics to classify responses and inconsistencies. • Identified edge cases in multimodal model performance and documented findings.

2026 - Present

Education

U

University of Warwick

Master of Engineering, Advanced Mechanical Engineering

Master of Engineering

2021 - 2022

C

Coventry University

Bachelor of Engineering, Mechanical Engineering

Bachelor of Engineering

2017 - 2020

Work History

D

Dilniya

Founder & Developer

London

2026 - Present

R

Rolls-Royce

Mechanical Engineer

Derby

2024 - 2025