Obed Johnson - AI Response Evaluator & Prompt Engineer | Outlier

Key Skills

Software

No software listed

Top Subject Matter

AI-generated text

Code Domain Expertise

and images

Top Data Types

Image

Text

Document

Top Task Types

Classification

Evaluation Rating

RLHF

Computer Programming Coding

Prompt Response Writing SFT

Freelancer Overview

AI trainer and evaluator with a strong software engineering and machine learning background. Working with Outlier since 2024 across multiple projects involving AI response quality assessment, prompt engineering, code correctness review, and text-to-image evaluation. Core work includes rating AI-generated responses across structured quality dimensions: instruction following, truthfulness, verbosity, prompt correctness, and writing style. Also experienced in writing detailed preference justifications, crafting high-level computer science prompts, correcting model-generated code, and evaluating text-to-image outputs for prompt alignment and quality. Technical evaluations are grounded in 7+ years of hands-on software engineering experience across Python, JavaScript, Angular, Next.js, and ML/AI systems.

IntermediateIgboFrenchEnglish

Labeling Experience

AI Response Evaluator & Prompt Engineer | Outlier

TextRLHF

AI Response Evaluator & Prompt Engineer | Outlier | 2024 – Present Remote | Multiple concurrent projects | Task management: Multimango | Time tracking: Hubstaff • Evaluate pairs of AI-generated responses side-by-side across five structured quality dimensions and write detailed preference justifications identifying specific strengths and weaknesses in each response, referencing concrete examples rather than general observations • Review and correct model-generated code for logical errors, runtime failures, and edge case handling across varied programming languages and difficulty levels, tracing code line by line to confirm correct output • Write high-level computer science prompts across multiple subdomains and difficulty tiers on the Millennium Leaf project, calibrating ambiguity, constraints, domain knowledge requirements, and program scope per task specifications • Evaluate text-to-image model outputs for prompt alignment, visual quality, and comparative ranking on the Aether project, assessing multiple generation metrics per task • Apply specialist knowledge of software engineering, ML/AI systems, and computer vision to assess technically complex responses that require domain expertise beyond general annotation capability Stack: Python, JavaScript, TypeScript, software engineering, ML/AI systems, computer vision, data structures and algorithms, Outlier, Multimango, Hubstaff

2024 - Present

Education

U

University of Nottingham

Master of Science, Computer Science

Master of Science

2024 - 2026

D

Data Link Institute of Business & Technology

Bachelor of Science, Information and Communication Technology

Bachelor of Science

2017 - 2021

Work History

F

Freelance

Freelance Software Engineer & Designer

Nottingham

2022 - Present

V

Vasil Tech

Web Developer

Nottingham

2021 - 2024