For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
H
Hasnain Mubashir

Hasnain Mubashir

Team Lead - Terminal Bench 2.0 (Harbor) | QA Reviewer | LLM Trainer

Pakistan flagLahore, Pakistan
$20.00/hrIntermediateInternal Proprietary Tooling

Key Skills

Software

Internal/Proprietary Tooling

Top Subject Matter

LLM Evaluation
Qa Domain Expertise
AI Training

Top Data Types

TextText
DocumentDocument
Computer Code ProgrammingComputer Code Programming

Top Task Types

RLHFRLHF
Computer Programming/CodingComputer Programming/Coding
Function CallingFunction Calling

Freelancer Overview

Team Lead - Terminal Bench 2.0 (Harbor) | QA Reviewer | LLM Trainer. Brings 4+ years of professional experience across legal operations, contract review, compliance, and structured analysis. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Science, GCU Lahore (2023). AI-training focus includes data types such as Text and labeling workflows including Evaluation and Rating.

IntermediateEnglish

Labeling Experience

Team Lead - Terminal Bench 2.0 (Harbor) | QA Reviewer | LLM Trainer

Text
Oversaw a team in conducting rubric-based evaluation and review tasks for benchmark-oriented LLM projects. Performed detailed prompt and response evaluations, including red-teaming, instruction-following assessment, and edge-case identification for LLM training pipelines. Contributed actively to prompt engineering, annotation, response ranking, and training data refinement workflows. • Maintained and elevated team consistency in LLM quality assurance and review standards. • Reviewed 30+ LLM coding and reasoning tasks per week for accuracy, safety, completeness, and reasoning quality. • Identified hidden failure modes, adversarial scenarios, and performed evaluator calibration across projects. • Liaised with stakeholders to balance throughput targets with rigorous rubric and output criteria.

Oversaw a team in conducting rubric-based evaluation and review tasks for benchmark-oriented LLM projects. Performed detailed prompt and response evaluations, including red-teaming, instruction-following assessment, and edge-case identification for LLM training pipelines. Contributed actively to prompt engineering, annotation, response ranking, and training data refinement workflows. • Maintained and elevated team consistency in LLM quality assurance and review standards. • Reviewed 30+ LLM coding and reasoning tasks per week for accuracy, safety, completeness, and reasoning quality. • Identified hidden failure modes, adversarial scenarios, and performed evaluator calibration across projects. • Liaised with stakeholders to balance throughput targets with rigorous rubric and output criteria.

2025 - Present

Education

G

GCU Lahore

Bachelor of Science, Electronics

Bachelor of Science
2019 - 2023

Work History

R

Raven Tech

Full Stack Developer

Lahore
2023 - 2025
R

Raven Tech

Frontend Developer

Lahore
2022 - 2023