For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
S

Sikandar Saleem

LLM AI Trainer (RLHF Evaluator & Prompt Engineer)

USA flagRemote, Usa
Intermediate

Key Skills

Software

No software listed

Top Subject Matter

General Large Language Models (LLMs)
AI-generated content
prompt engineering

Top Data Types

TextText
DocumentDocument

Top Task Types

RLHF

Freelancer Overview

LLM AI Trainer (RLHF Evaluator & Prompt Engineer). Brings 7+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Science, University of Central Florida (2020). AI-training focus includes data types such as Text and labeling workflows including RLHF.

Intermediate

Labeling Experience

LLM AI Trainer (RLHF Evaluator & Prompt Engineer)

TextRLHF
As a Senior Software Engineer, I participated directly in Large Language Model (LLM) fine-tuning and evaluation tasks. My responsibilities included reviewing LLM outputs, providing corrections, refining prompts, and curating datasets to improve response quality. I played a key role in aligning model responses with business requirements, using reinforcement learning from human feedback (RLHF) and prompt engineering. • Evaluated LLM responses for relevance, coherence, and accuracy based on pre-defined metrics. • Trained AI models by reviewing outputs and rewriting model answers to enhance clarity and correctness. • Created and refined prompts and datasets to help the AI understand intent and context more accurately. • Leveraged internal/proprietary tools for prompt + response writing, RLHF feedback, and dataset enhancements.

As a Senior Software Engineer, I participated directly in Large Language Model (LLM) fine-tuning and evaluation tasks. My responsibilities included reviewing LLM outputs, providing corrections, refining prompts, and curating datasets to improve response quality. I played a key role in aligning model responses with business requirements, using reinforcement learning from human feedback (RLHF) and prompt engineering. • Evaluated LLM responses for relevance, coherence, and accuracy based on pre-defined metrics. • Trained AI models by reviewing outputs and rewriting model answers to enhance clarity and correctness. • Created and refined prompts and datasets to help the AI understand intent and context more accurately. • Leveraged internal/proprietary tools for prompt + response writing, RLHF feedback, and dataset enhancements.

2024 - Present

Education

U

University of Central Florida

Bachelor of Science, Computer Science

Bachelor of Science
2016 - 2020

Work History

T

Turing

Senior Software Engineer

Remote
2024 - Present
T

Tkxel

Senior Software Engineer

Remote
2023 - 2024