For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
B
Bryan Shin

Bryan Shin

LLM Evaluation Specialist

USA flagSan Ramon, Usa
$40.00/hrIntermediateData Annotation Tech

Key Skills

Software

Data Annotation TechData Annotation Tech

Top Subject Matter

AI Alignment and LLM Model Evaluation

Top Data Types

TextText
Computer Code ProgrammingComputer Code Programming
ImageImage

Top Task Types

RLHFRLHF
Computer Programming/CodingComputer Programming/Coding
Evaluation/RatingEvaluation/Rating

Freelancer Overview

LLM Evaluation Specialist. Brings 2+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Other. Education includes Bachelor of Science, University of California, Davis (2024). AI-training focus includes data types such as Computer Code and Programming and labeling workflows including Evaluation and Rating.

IntermediateEnglishKorean

Labeling Experience

LLM Evaluation Specialist

Other
I evaluated and rated large language model (LLM) outputs on various coding scenarios. My work focused on correctness, instruction following, and safety assessments for reinforcement learning from human feedback and model alignment tasks. I engineered and utilized prompts to review, score, and provide evaluative feedback for model improvement cycles. • Evaluated LLM performance on code generation and completion tasks. • Provided multi-axis ratings for LLM responses based on established criteria. • Designed task-specific prompts tailored to different programming problems. • Supported RLHF and RLVR pipelines by contributing human evaluative judgments.

I evaluated and rated large language model (LLM) outputs on various coding scenarios. My work focused on correctness, instruction following, and safety assessments for reinforcement learning from human feedback and model alignment tasks. I engineered and utilized prompts to review, score, and provide evaluative feedback for model improvement cycles. • Evaluated LLM performance on code generation and completion tasks. • Provided multi-axis ratings for LLM responses based on established criteria. • Designed task-specific prompts tailored to different programming problems. • Supported RLHF and RLVR pipelines by contributing human evaluative judgments.

2024 - Present

Education

U

University of California, Davis

Bachelor of Science, Computer Science

Bachelor of Science
2019 - 2024

Work History

T

Team Koprulu

Software Engineer

San Ramon
2025 - Present
E

EvoLadderBot

Software Engineer

San Ramon
2025 - Present