Gabriel Pravaz - AI Trainer, Alignerr (Anthropic)

Key Skills

Software

No software listed

Top Subject Matter

AI Training / Human Preference Feedback

Top Data Types

Image

Top Task Types

RLHF

Freelancer Overview

AI Trainer, Alignerr (Anthropic). Brings 23+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Professional Certification, Microsoft (2011) and Bachelor of Science, Universidad Tecnológica Nacional (FRBA) (2003). AI-training focus includes data types such as Computer Code and Programming and labeling workflows including RLHF.

Entry Level

Labeling Experience

AI Trainer, Alignerr (Anthropic)

RLHF

As an AI Trainer for Alignerr, I contributed to the development of AI models using the Python programming language. My role focused on providing human preference feedback and engaging in reinforcement learning from human feedback (RLHF) tasks. This work aimed to improve the effectiveness and safety of Anthropic's language models. • Performed pairwise comparison and rating of AI-generated code outputs. • Used internal and proprietary annotation platforms designed for AI training tasks. • Helped refine complex prompts and responses. • Collaborated with domain experts to optimize labeling processes.

2026 - Present

Education

M

Microsoft

Professional Certification, Web Development

Professional Certification

2011 - 2011

U

Universidad Tecnológica Nacional (FRBA)

Bachelor of Science, Systems Engineering

Bachelor of Science

2001 - 2003

Work History

F

Findan Software

Tech Lead

Buenos Aires

2016 - Present

U

Upwork

Freelance Developer

Buenos Aires

2014 - 2016