Shoaib Wani - LLM Evaluator and AI Alignment Analyst - Technology & Internet

Key Skills

Software

Other

iMerit

Scale AI

CVAT

Top Subject Matter

No subject matter listed

Top Data Types

Text

Video

Image

Top Task Types

Evaluation/Rating

Red Teaming

Computer Programming/Coding

Prompt + Response Writing (SFT)

Bounding Box

Classification

Transcription

Freelancer Overview

Education includes Doctor of Philosophy, Jawaharlal Nehru University (2026).

Entry LevelHindiArabicUrduEnglish

Labeling Experience

Computer Vision Annotation Contractor – CVAT Projects

CVATImageBounding Box

As a Computer Vision Annotation Contractor for CVAT Projects, I annotated bounding boxes and segmentation masks for object detection datasets. My work focused on labeling images to assist in the training of computer vision AI. This experience contributed to dataset quality and model precision. • Drew bounding boxes around objects in image datasets • Applied segmentation masks for detailed labeling • Reviewed and validated annotations for quality control • Used CVAT tools to ensure efficient annotation workflows •Reviewed visual data quality and object placement within images used for machine learning datasets.

2023 - 2025

AI Data Annotation Specialist – iMerit

ImeritTextBounding BoxClassification

As an AI Data Annotation Specialist at iMerit, I performed high-accuracy annotation tasks on textual and image data for supervised machine learning projects. I maintained rigorous QA standards and labeled complex edge cases in datasets. My responsibility was to ensure data quality and consistency for effective model training. • Provided precise data labeling for both text and images • Upheld QA standards throughout annotation workflows • Identified and labeled ambiguous or challenging edge cases • Enhanced the accuracy of training data for ML systems

2023

LLM Evaluator – Outlier AI

OtherTextEvaluation RatingRed Teaming

As an LLM Evaluator at Outlier AI, I evaluated and ranked LLM-generated responses according to defined metrics. I authored high-quality gold standard responses and flagged instances of hallucination and bias in outputs. My work supported RLHF pipelines for improving natural language processing models. • Conducted detailed prompt response grading for accuracy and relevance • Developed clear, gold-standard responses for benchmarking • Identified and reported cases of bias and hallucination in model outputs • Collaborated with RLHF teams for model alignment improvements

2023

LLM Evaluation & Dataset QA Analyst – Scale AI

Scale AITextEvaluation Rating

As an LLM Evaluation & Dataset QA Analyst at Scale AI, I was responsible for grading LLM responses and validating gold-label annotations. I wrote and structured JSON prompts and performed preference ranking for conversational models. This work contributed to dataset quality assurance and model improvement. • Graded LLM outputs to assess correctness and coherence • Authored and reviewed JSON prompt-response pairs • Validated gold label consistency across datasets • Ranked and annotated preferences for dialogue systems

2023

High-Precision Singing Voice Synthesis (SVS) Corpus Annotation

AudioTranscription

Managed end-to-end annotation of an English singing dataset for SVS training. Performed phoneme alignment using TextGrid and Praat, labeled f₀ pitch and note transitions, followed GTSinger standards, and validated labels with Python (Librosa, Parselmouth) for DiffSinger model training.

2023 - 2023

Education

J

Jawaharlal Nehru University

Doctor of Philosophy, Artificial Intelligence and Machine Learning

Doctor of Philosophy

2026 - 2025

J

Jamia Millia Islamia

Master of Science, Machine Learning

Master of Science

2023 - 2025

Work History

T

Tata Consultancy Services

Machine Learning Researcher

New Delhi

2024 - 2025

I

Infosys

AI & Data Science Trainee

New Delhi

2022 - 2023