For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
V

Vladyslav Leliukh

LLM Training & RLHF Subject Matter Expert

Scotland flagEdinburgh, Scotland
$14.00/hrExpertLabelboxScale AIRemotasks

Key Skills

Software

LabelboxLabelbox
Scale AIScale AI
RemotasksRemotasks

Top Subject Matter

Large Language Models Training for Marketing
Brand Voice
and Business Logic

Top Data Types

TextText

Top Task Types

RLHFRLHF

Freelancer Overview

LLM Training & RLHF Subject Matter Expert (Ocado Technology). Brings 9+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Arts, University of Westminster (2018). AI-training focus includes data types such as Text and labeling workflows including RLHF.

ExpertEnglishPolishRussianUkrainian

Labeling Experience

LLM Training & RLHF Subject Matter Expert (Ocado Technology)

TextRLHF
Provided high-quality human feedback for Large Language Model (LLM) training focused on optimizing reasoning, factual accuracy, and brand-voice alignment. Led AI operations that structured instructional inputs and review data for advanced reinforcement learning systems. Utilized prompt engineering and supervised fine-tuning methods to directly improve model outputs for business-focused use cases. • Designed and executed data annotation workflows for business logic alignment. • Conducted thorough evaluation/rating of LLM generated responses. • Applied LLM evaluation, fact-checking, and prompt engineering for AI safety and hallucination mitigation. • Integrated feedback cycles using internal/proprietary tooling and Python-based review scripts.

Provided high-quality human feedback for Large Language Model (LLM) training focused on optimizing reasoning, factual accuracy, and brand-voice alignment. Led AI operations that structured instructional inputs and review data for advanced reinforcement learning systems. Utilized prompt engineering and supervised fine-tuning methods to directly improve model outputs for business-focused use cases. • Designed and executed data annotation workflows for business logic alignment. • Conducted thorough evaluation/rating of LLM generated responses. • Applied LLM evaluation, fact-checking, and prompt engineering for AI safety and hallucination mitigation. • Integrated feedback cycles using internal/proprietary tooling and Python-based review scripts.

2022 - 2026

Education

U

University of Westminster

Bachelor of Arts, Business Management

Bachelor of Arts
2015 - 2018

Work History

O

Ocado Technology

Growth Operations & AI Lead

London
2022 - Present
C

Connect-Riviera

Digital Marketing Project Specialist

Nice
2018 - 2022