For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Sayeem Khan

Sayeem Khan

Software Engineer in Contract Review, Compliance, and Legal Research

BANGLADESH flag
Dhaka, Bangladesh
$15.00/hrExpertAnno MageAws SagemakerAppen

Key Skills

Software

Anno-MageAnno-Mage
AWS SageMakerAWS SageMaker
AppenAppen
ArgillaArgilla
Axiom AI
CloudFactoryCloudFactory
CrowdSourceCrowdSource
DataloopDataloop
HumanaticHumanatic
LabelImgLabelImg
LightTagLightTag
LionbridgeLionbridge
MercorMercor
Mighty AIMighty AI
OpenCV AI Kit (OAK)OpenCV AI Kit (OAK)
Redbrick AIRedbrick AI
RoboflowRoboflow
Scale AIScale AI
Snorkel AISnorkel AI
SuperviselySupervisely
TagtogTagtog
TolokaToloka
VoTT
V7 LabsV7 Labs

Top Subject Matter

Legal Services & Contract Review
Regulatory Compliance & Risk Analysis
Legal Research & Document Analysis

Top Data Types

Computer Code ProgrammingComputer Code Programming
TextText
DocumentDocument

Top Task Types

Bounding Box
Classification
Point Key Point
Cuboid
Text Summarization
RLHF
Fine Tuning
Transcription
Computer Programming Coding
Data Collection
Function Calling
Prompt Response Writing SFT
Text Generation
Object Detection
Polyline
Segmentation

Freelancer Overview

Software Engineer in Contract Review, Compliance, and Legal Research. Brings 7+ years of professional experience across complex professional workflows, research, and quality-focused execution. Education includes Bachelor of Science, American International University-Bangladesh (2026) and Higher Secondary Certificate, Scholars School and College (2021).

ExpertBengaliEnglish

Labeling Experience

Text Sentiment & Category Labeling Dataset

TextClassification
Annotated 500 user-generated product reviews with two independent label types: a three-class sentiment label (positive, neutral, negative) and one of eight topic category labels (electronics, clothing, home & kitchen, etc.). Applied structured annotation guidelines to handle edge cases including sarcasm, mixed sentiment, and ambiguous categories. Performed a 10% re-labeling QA pass achieving ~94% inter-annotator agreement. Delivered a structured CSV dataset with confidence scores and quality flags, simulating real-world text classification workflows.

Annotated 500 user-generated product reviews with two independent label types: a three-class sentiment label (positive, neutral, negative) and one of eight topic category labels (electronics, clothing, home & kitchen, etc.). Applied structured annotation guidelines to handle edge cases including sarcasm, mixed sentiment, and ambiguous categories. Performed a 10% re-labeling QA pass achieving ~94% inter-annotator agreement. Delivered a structured CSV dataset with confidence scores and quality flags, simulating real-world text classification workflows.

2025 - Present

Street Scene Image Bounding Box Annotation

ImageBounding Box
Annotated 120 street-scene images with bounding boxes across 6 object classes (car, person, bicycle, traffic sign, truck, bus) using LabelImg, producing 998 total labeled objects. Followed professional annotation guidelines covering occlusion handling, truncation flags, crowd instances, and bounding box tightness standards. Exported annotations in both PASCAL VOC (XML) and YOLO (TXT) formats. Conducted a second-pass QA review on 10% of images using an IoU threshold of ≥0.85, replicating object detection annotation pipelines.

Annotated 120 street-scene images with bounding boxes across 6 object classes (car, person, bicycle, traffic sign, truck, bus) using LabelImg, producing 998 total labeled objects. Followed professional annotation guidelines covering occlusion handling, truncation flags, crowd instances, and bounding box tightness standards. Exported annotations in both PASCAL VOC (XML) and YOLO (TXT) formats. Conducted a second-pass QA review on 10% of images using an IoU threshold of ≥0.85, replicating object detection annotation pipelines.

2024 - Present

RLHF AI Response Preference Ranking Dataset

TextRLHF
Evaluated and ranked 200 AI-generated response pairs across 4 quality dimensions — accuracy, helpfulness, clarity, and safety — using a structured 1–5 scoring rubric. Assigned overall preference labels (A better, B better, tie, both bad) to each pair across 5 prompt categories including factual Q&A, creative writing, how-to instructions, coding, and open-ended conversation. Conducted a 15% blind re-evaluation pass achieving ~88% agreement rate. Replicates the human feedback annotation workflow used by Scale AI and Surge AI for LLM alignment training.

Evaluated and ranked 200 AI-generated response pairs across 4 quality dimensions — accuracy, helpfulness, clarity, and safety — using a structured 1–5 scoring rubric. Assigned overall preference labels (A better, B better, tie, both bad) to each pair across 5 prompt categories including factual Q&A, creative writing, how-to instructions, coding, and open-ended conversation. Conducted a 15% blind re-evaluation pass achieving ~88% agreement rate. Replicates the human feedback annotation workflow used by Scale AI and Surge AI for LLM alignment training.

2023 - Present

Education

A

American International University-Bangladesh

Bachelor of Science, Computer Science and Engineering

Bachelor of Science
2022 - 2026
S

Scholars School and College

Higher Secondary Certificate, Science

Higher Secondary Certificate
2021 - 2021

Work History

K

Kona Software Lab

Software Engineer

Dhaka
2024 - Present
T

Therap

Junior Software Engineer

Dhaka
2022 - 2024