For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
M
muhammed abdulshakur

muhammed abdulshakur

AI Model Evaluator – Benchmark Project

Nigeria flagAbuja, Nigeria
$15.00/hrEntry LevelOther

Key Skills

Software

Other

Top Subject Matter

Frontier AI language models
Computer vision / video datasets

Top Data Types

TextText
VideoVideo
ImageImage

Top Task Types

Object DetectionObject Detection
SegmentationSegmentation
ClassificationClassification
Entity (NER) ClassificationEntity (NER) Classification
Text GenerationText Generation
Question AnsweringQuestion Answering
Fine-tuningFine-tuning
Text SummarizationText Summarization
Evaluation/RatingEvaluation/Rating
Computer Programming/CodingComputer Programming/Coding
Data CollectionData Collection
Function CallingFunction Calling

Freelancer Overview

Detail-oriented AI Data Annotator and Model Evaluator with experience working on AI training and benchmarking projects with Turing. Skilled in video data annotation, AI response evaluation, and benchmarking frontier language models including OpenAI, Claude, and Gemini. Experienced in developing ideal task solutions, assessing model outputs for reasoning quality and correctness, and maintaining high-quality datasets for machine learning training. Education includes Bachelor of Engineering, Federal University of Technology Minna (2018).

Entry LevelEnglish

Labeling Experience

AI Model Evaluator – Benchmark Project

OtherText
As an AI Model Evaluator on the Benchmark Project at Turing, I developed ideal reference solutions for tasks used in frontier AI model evaluations. My responsibilities included systematically assessing model outputs from systems such as OpenAI, Claude, and Gemini for correctness, reasoning, and instruction following. I provided comprehensive feedback to enhance the quality of AI systems through structured analysis. • Evaluated AI model responses for accuracy and completeness. • Benchmarked frontier language models for reasoning and adherence to prompts. • Developed task solutions and assessment frameworks. • Identified model weaknesses and hallucinations.

As an AI Model Evaluator on the Benchmark Project at Turing, I developed ideal reference solutions for tasks used in frontier AI model evaluations. My responsibilities included systematically assessing model outputs from systems such as OpenAI, Claude, and Gemini for correctness, reasoning, and instruction following. I provided comprehensive feedback to enhance the quality of AI systems through structured analysis. • Evaluated AI model responses for accuracy and completeness. • Benchmarked frontier language models for reasoning and adherence to prompts. • Developed task solutions and assessment frameworks. • Identified model weaknesses and hallucinations.

2026 - 2026

AI Data Annotator – Video Labeling

OtherVideoObject Detection
As an AI Data Annotator specializing in video labeling at Turing, I contributed to machine learning projects by meticulously annotating video datasets. I identified and labeled objects, actions, and contextual elements across video frames to support computer vision training. Ensuring accuracy, I followed specific guidelines to deliver high-quality data for AI development. • Annotated and labeled video frames for object and action recognition. • Maintained detailed adherence to annotation protocols and accuracy standards. • Supported dataset creation for machine learning models. • Enhanced data quality for computer vision model training.

As an AI Data Annotator specializing in video labeling at Turing, I contributed to machine learning projects by meticulously annotating video datasets. I identified and labeled objects, actions, and contextual elements across video frames to support computer vision training. Ensuring accuracy, I followed specific guidelines to deliver high-quality data for AI development. • Annotated and labeled video frames for object and action recognition. • Maintained detailed adherence to annotation protocols and accuracy standards. • Supported dataset creation for machine learning models. • Enhanced data quality for computer vision model training.

2025 - 2025

Education

F

Federal University of Technology Minna

Bachelor of Engineering, Chemical Engineering

Bachelor of Engineering
2018 - 2018

Work History

T

Township Tech Development Foundation

Lead Instructor

Ilesha
2023 - Present
F

Functionstack

Data Engineer & Backend Developer

London
2023 - Present