muhammed abdulshakur - AI Model Evaluator – Benchmark Project

Key Skills

Software

Other

Top Subject Matter

Frontier AI language models

Computer vision / video datasets

Top Data Types

Text

Video

Image

Top Task Types

Object Detection

Segmentation

Classification

Entity (NER) Classification

Text Generation

Question Answering

Fine-tuning

Text Summarization

Evaluation/Rating

Computer Programming/Coding

Data Collection

Function Calling

Freelancer Overview

Detail-oriented AI Data Annotator and Model Evaluator with experience working on AI training and benchmarking projects with Turing. Skilled in video data annotation, AI response evaluation, and benchmarking frontier language models including OpenAI, Claude, and Gemini. Experienced in developing ideal task solutions, assessing model outputs for reasoning quality and correctness, and maintaining high-quality datasets for machine learning training. Education includes Bachelor of Engineering, Federal University of Technology Minna (2018).

Entry LevelEnglish

Labeling Experience

AI Model Evaluator – Benchmark Project

OtherText

As an AI Model Evaluator on the Benchmark Project at Turing, I developed ideal reference solutions for tasks used in frontier AI model evaluations. My responsibilities included systematically assessing model outputs from systems such as OpenAI, Claude, and Gemini for correctness, reasoning, and instruction following. I provided comprehensive feedback to enhance the quality of AI systems through structured analysis. • Evaluated AI model responses for accuracy and completeness. • Benchmarked frontier language models for reasoning and adherence to prompts. • Developed task solutions and assessment frameworks. • Identified model weaknesses and hallucinations.

2026 - 2026

AI Data Annotator – Video Labeling

OtherVideoObject Detection

As an AI Data Annotator specializing in video labeling at Turing, I contributed to machine learning projects by meticulously annotating video datasets. I identified and labeled objects, actions, and contextual elements across video frames to support computer vision training. Ensuring accuracy, I followed specific guidelines to deliver high-quality data for AI development. • Annotated and labeled video frames for object and action recognition. • Maintained detailed adherence to annotation protocols and accuracy standards. • Supported dataset creation for machine learning models. • Enhanced data quality for computer vision model training.

2025 - 2025

Education

F

Federal University of Technology Minna

Bachelor of Engineering, Chemical Engineering

Bachelor of Engineering

2018 - 2018

Work History

T

Township Tech Development Foundation

Lead Instructor

Ilesha

2023 - Present

F

Functionstack

Data Engineer & Backend Developer

London

2023 - Present