For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Yaroslav Shyryayev

Yaroslav Shyryayev

AI Red Teamer - AI Safety & RLHF

ITALY flag
sassari, Italy
$45.00/hrExpertAppenLabelboxMercor

Key Skills

Software

AppenAppen
LabelboxLabelbox
MercorMercor
OneFormaOneForma
Scale AIScale AI
SuperAnnotateSuperAnnotate

Top Subject Matter

No subject matter listed

Top Data Types

AudioAudio
Computer Code ProgrammingComputer Code Programming
TextText

Top Label Types

RLHF
Red Teaming
Evaluation Rating
Prompt Response Writing SFT
Entity Ner Classification

Freelancer Overview

I am a multilingual AI Data Specialist with extensive experience in data labeling, annotation, and AI training data for leading platforms such as Outlier, Alignerr, Appen, and OneForma. My work has focused on evaluating and refining Large Language Models (LLMs) through Reinforcement Learning from Human Feedback (RLHF), prompt engineering, red teaming, and fact-checking, ensuring high-quality, safe, and aligned AI outputs. I am skilled in generating and annotating complex prompts across Italian, Russian, and English, and have a strong background in both NLP and medical data workflows, including digital transformation using CAD/CAM and 3D printing technologies. My technical toolkit includes SQL, advanced Excel, ChatGPT, Claude, Copilot, and various annotation interfaces. I am highly organized, detail-oriented, and certified in advanced AI prompting, data analysis, and agile project management, enabling me to deliver accurate, reliable, and efficient training data for cutting-edge AI applications.

ExpertEnglishItalianRussian

Labeling Experience

SuperAnnotate

AI Red Teaming & Safety Specialist

SuperannotateTextRLHFRed Teaming
Adversarial testing and safety alignment for Large Language Models, specifically tailored for the Italian language market. Designing complex "jailbreak" prompts to bypass safety filters, identifying biases, and testing model boundaries regarding prohibited content, ethical guidelines, and security.Daily high-intensity testing sessions targeting various safety taxonomies. Adherence to strict "Harmlessness" policies and daily calibration with the safety team to ensure zero-tolerance for unsafe model outputs.

Adversarial testing and safety alignment for Large Language Models, specifically tailored for the Italian language market. Designing complex "jailbreak" prompts to bypass safety filters, identifying biases, and testing model boundaries regarding prohibited content, ethical guidelines, and security.Daily high-intensity testing sessions targeting various safety taxonomies. Adherence to strict "Harmlessness" policies and daily calibration with the safety team to ensure zero-tolerance for unsafe model outputs.

2025 - 2024
Labelbox

LLM Evaluation

LabelboxTextRLHF
Improving model reasoning and helpfulness through Reinforcement Learning from Human Feedback (RLHF).Crafting complex prompts and performing side-by-side (Model A vs Model B) evaluations. Ranking responses based on accuracy, tone, and formatting.Maintained a high Quality Score (QA) through rigorous fact-checking and adherence to project-specific style guides.

Improving model reasoning and helpfulness through Reinforcement Learning from Human Feedback (RLHF).Crafting complex prompts and performing side-by-side (Model A vs Model B) evaluations. Ranking responses based on accuracy, tone, and formatting.Maintained a high Quality Score (QA) through rigorous fact-checking and adherence to project-specific style guides.

2024 - 2025
Appen

Complex Audio Transcription & Sound Identification

AppenAudioEntity Ner Classification
Developing datasets for advanced speech recognition and sound detection.Transcribing verbatim audio files and inserting specific tags (Entity classification) for non-verbal sounds (sighs, pauses, background noise).Processed hundreds of hours of multi-speaker audio. Adherence to strict transcription conventions and time-stamping accuracy within millisecond thresholds.

Developing datasets for advanced speech recognition and sound detection.Transcribing verbatim audio files and inserting specific tags (Entity classification) for non-verbal sounds (sighs, pauses, background noise).Processed hundreds of hours of multi-speaker audio. Adherence to strict transcription conventions and time-stamping accuracy within millisecond thresholds.

2024 - 2024
Appen

LLM Training & Comparative Analysis

AppenTextPrompt Response Writing SFT
Training Generative AI models to provide better creative and logical responses.Writing original prompts to trigger specific model behaviors and evaluating the quality of two competing model outputs.

Training Generative AI models to provide better creative and logical responses.Writing original prompts to trigger specific model behaviors and evaluating the quality of two competing model outputs.

2024 - 2024
OneForma

Advanced LLM Response Evaluation (RLHF)

OneformaTextEvaluation Rating
Improving model reasoning and helpfulness through Reinforcement Learning from Human Feedback (RLHF).

Improving model reasoning and helpfulness through Reinforcement Learning from Human Feedback (RLHF).

2024 - 2024

Education

C

Coursera / Scrimba

Specialization Certificate, Software Development with AI Coding Assistants

Specialization Certificate
2026 - 2026
C

Coursera / Google

Specialization Certificate, Agile Project Management

Specialization Certificate
2026 - 2026

Work History

P

Private Dental Practice

Dental Studio Administrator & Lab Manager

Sassari
2016 - 2023