For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
R

Rajat Shaily Sharma

Generalist AI Trainer

United Kingdom flagGlasgow, United Kingdom
$17.50/hrEntry LevelOther

Key Skills

Software

Other

Top Subject Matter

Financial Compliance & Risk Analysis
Business Domain Expertise
Economics Domain Expertise

Top Data Types

TextText

Top Task Types

RLHF

Freelancer Overview

Generalist AI Trainer. Brings professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Other. Education includes Master of Science in Investment Banking and Finance, University of Glasgow (2026) and Bachelor of Business Administration, University of Petroleum and Energy Studies (2023). AI-training focus includes data types such as Text and labeling workflows including RLHF.

Entry LevelEnglishHindi

Labeling Experience

Generalist AI Trainer

OtherTextRLHF
As a Generalist AI Trainer, I evaluated and ranked AI-generated responses in various domains including finance, business, economics, and general knowledge. I wrote and refined prompts to test and improve model reasoning, and provided detailed annotations and rationales to inform reinforcement learning from human feedback (RLHF) pipelines. My work specifically leveraged domain expertise in finance and M&A to ensure technical accuracy in specialist outputs. • Evaluated and ranked outputs of large language models for accuracy, coherence, and helpfulness. • Crafted and edited prompts to probe model capabilities and surface edge-case behaviors. • Provided detailed preference annotations and written rationales for RLHF processes. • Applied subject matter expertise in finance to assess and guide specialist model outputs.

As a Generalist AI Trainer, I evaluated and ranked AI-generated responses in various domains including finance, business, economics, and general knowledge. I wrote and refined prompts to test and improve model reasoning, and provided detailed annotations and rationales to inform reinforcement learning from human feedback (RLHF) pipelines. My work specifically leveraged domain expertise in finance and M&A to ensure technical accuracy in specialist outputs. • Evaluated and ranked outputs of large language models for accuracy, coherence, and helpfulness. • Crafted and edited prompts to probe model capabilities and surface edge-case behaviors. • Provided detailed preference annotations and written rationales for RLHF processes. • Applied subject matter expertise in finance to assess and guide specialist model outputs.

2025 - 2026

Education

U

University of Glasgow

Master of Science, Investment Banking and Finance

Master of Science
2024 - 2026
U

University of Petroleum and Energy Studies

Bachelor of Business Administration, Oil and Gas Marketing

Bachelor of Business Administration
2020 - 2023

Work History

F

Federation of Indian Petroleum Industry

VP Finance

Dehradun
2020 - 2023
C

Coral Research Services

Business Analyst

New Delhi
2022 - 2022