For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
G

Gagandeep Singh

AI Feedback Aligner (Freelance) | Anthropic

Canada flagBrampton, Canada
$25.00/hrExpertSuperannotateLabelboxMercor

Key Skills

Software

SuperAnnotateSuperAnnotate
LabelboxLabelbox
MercorMercor

Top Subject Matter

AI Model Training and Code Evaluation
Computer Programming
Technical Writing

Top Data Types

Computer Code ProgrammingComputer Code Programming
TextText
DocumentDocument

Top Task Types

RLHF
Computer Programming Coding
Evaluation Rating
Function Calling
Fine Tuning
Transcription
Data Collection
Text Generation
Object Detection
Prompt Response Writing SFT
Question Answering

Freelancer Overview

I have been working directly with Anthropic through Aligner as an AI feedback contractor since late 2024 where I review and evaluate Claude generated code outputs for correctness efficiency and security. This work feeds directly into reinforcement learning from AI feedback so I understand how training data quality impacts model behavior at a fundamental level. Before that I spent years at Vosyn AI building speech to speech pipelines and RAG systems using AWS SageMaker Bedrock and LLM APIs which gave me a strong technical foundation for understanding what good AI output actually looks like.

ExpertEnglish

Labeling Experience

AI Feedback Aligner (Freelance) | Anthropic

OtherRLHF
Provided detailed feedback on AI-generated code outputs to improve model performance through reinforcement learning from AI feedback. Evaluated outputs for correctness, efficiency, and security in alignment with best practices. Cleared and refined AI-assisted code contributions to ensure they met enterprise standards. • Focused on both technical and quality standards. • Regularly suggested improvements for better outcomes. • Contributed to enhancing Claude model reliability and alignment. • Ensured outputs were ready for production use.

Provided detailed feedback on AI-generated code outputs to improve model performance through reinforcement learning from AI feedback. Evaluated outputs for correctness, efficiency, and security in alignment with best practices. Cleared and refined AI-assisted code contributions to ensure they met enterprise standards. • Focused on both technical and quality standards. • Regularly suggested improvements for better outcomes. • Contributed to enhancing Claude model reliability and alignment. • Ensured outputs were ready for production use.

2025 - Present

Education

S

Seneca College

Diploma, Building Systems Engineering

Diploma
2025 - 2025
W

Western Governors University

Bachelor of Science, Data Analytics

Bachelor of Science
2025 - 2025

Work History

B

Branchy Solution

Data Engineer

Brampton
2025 - Present
V

Vosyn AI

Software Engineer

Brampton
2019 - 2024