For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
P

Patrick Anselmi Barbour

AI Alignment and RLHF Data Labeler/Evaluator

USA flagHuntington, Usa
Entry Level

Key Skills

Software

No software listed

Top Subject Matter

AI alignment
Cybersecurity Domain Expertise
multi-agent LLM evaluation

Top Data Types

TextText

Top Task Types

RLHFRLHF

Freelancer Overview

AI Alignment and RLHF Data Labeler/Evaluator. Brings 4+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Science, Unknown (2020). AI-training focus includes data types such as Text and labeling workflows including RLHF.

Entry Level

Labeling Experience

AI Alignment and RLHF Data Labeler/Evaluator

TextRLHF
Performed reinforcement learning from human feedback (RLHF) on large language models to enhance alignment and reasoning. Evaluated model outputs for quality, ethical adherence, and logical accuracy using adversarial red teaming and structured prompt evaluation. Delivered high-signal preference data and prompt/response reviews for training autonomous AI agents. • Conducted hands-on prompt processing and red team testing in multi-agent LLM environments. • Rated, evaluated, and provided feedback across a wide range of AI-generated outputs, including technical and cybersecurity content. • Executed persistent, asynchronous model evaluations for context retention and advanced reasoning tasks. • Leveraged specialized mobile-centric infrastructure to assess mobile-first AI agent deployment and security.

Performed reinforcement learning from human feedback (RLHF) on large language models to enhance alignment and reasoning. Evaluated model outputs for quality, ethical adherence, and logical accuracy using adversarial red teaming and structured prompt evaluation. Delivered high-signal preference data and prompt/response reviews for training autonomous AI agents. • Conducted hands-on prompt processing and red team testing in multi-agent LLM environments. • Rated, evaluated, and provided feedback across a wide range of AI-generated outputs, including technical and cybersecurity content. • Executed persistent, asynchronous model evaluations for context retention and advanced reasoning tasks. • Leveraged specialized mobile-centric infrastructure to assess mobile-first AI agent deployment and security.

Present

Education

U

Unknown

Bachelor of Science, Cybersecurity

Bachelor of Science
2020

Work History

A

AI Nexus

Architect & Developer

Huntington
2022 - 2023
H

Home Network Intelligence System

Network Architect

Huntington
2021 - 2022