For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
M
Md Touhidul Islam

Md Touhidul Islam

AI Trainer

Bangladesh flagRajshahi, Bangladesh
Expert

Key Skills

Software

No software listed

Top Subject Matter

Large Language Models (LLMs)
AI Reasoning
Prompt Engineering

Top Data Types

TextText
VideoVideo

Top Task Types

RLHFRLHF

Freelancer Overview

AI Trainer. Brings 4+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Computer Science, Rajshahi University of Engineering & Technology (2026). AI-training focus includes data types such as Text and labeling workflows including RLHF.

Expert

Labeling Experience

AI Trainer

TextRLHF
As an AI Trainer at Outlier AI & Binary Cognition, I evaluated LLM outputs for reasoning quality, factual accuracy, instruction following, and robustness. I provided Reinforcement Learning from Human Feedback (RLHF) preference feedback and created adversarial test cases to stress-test model behavior. My work produced high-quality annotations and assessments used in fine-tuning pipelines. • Reviewed and rated AI-generated responses to diverse prompts for quality and accuracy. • Authored adversarial prompts that challenged model capabilities and edge cases. • Provided RLHF preference judgments to guide LLM tuning and improvement. • Collaborated with research teams to enhance annotation guidelines and model reliability.

As an AI Trainer at Outlier AI & Binary Cognition, I evaluated LLM outputs for reasoning quality, factual accuracy, instruction following, and robustness. I provided Reinforcement Learning from Human Feedback (RLHF) preference feedback and created adversarial test cases to stress-test model behavior. My work produced high-quality annotations and assessments used in fine-tuning pipelines. • Reviewed and rated AI-generated responses to diverse prompts for quality and accuracy. • Authored adversarial prompts that challenged model capabilities and edge cases. • Provided RLHF preference judgments to guide LLM tuning and improvement. • Collaborated with research teams to enhance annotation guidelines and model reliability.

2022 - Present

Education

R

Rajshahi University of Engineering & Technology

Bachelor of Science, Computer Science and Engineering

Bachelor of Science
2022 - 2026

Work History

J

Jazariplex Technologies

AI Growth Automation Engineer

Rajshahi
2026 - Present
O

Obby

AI Automation Engineer & Founding Member

Rajshahi
2023 - 2025