Arnav Hazari - AI Development Fellow

Key Skills

Software

Telus

Surge AI

Snorkel AI

Scale AI

Remotasks

Data Annotation Tech

CrowdSource

Mercor

Top Subject Matter

AI model evaluation and text output analysis

Multimodal generative AI quality evaluation

Legal Services & Contract Review

Top Data Types

Text

Document

Audio

Top Task Types

Bounding Box

Text Generation

Question Answering

Text Summarization

RLHF

Fine Tuning

Evaluation Rating

Computer Programming Coding

Transcription

Data Collection

Prompt Response Writing SFT

Freelancer Overview

AI Development Fellow. Brings 2+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Business Administration, University of Georgia (2024). AI-training focus includes data types such as Text and labeling workflows including Evaluation and Rating.

IntermediateEnglish

Labeling Experience

Generative AI Specialist

Text

I evaluated the quality of multimodal AI outputs within production data pipelines, proactively identifying systematic output error patterns. My recommendations were structured for remediation and shared directly with engineering to guide improvements. This process contributed to the establishment of analytical benchmarking standards across the department. • Reviewed and rated over 500 outputs for quality and consistency. • Built frameworks to standardize repeatable evaluation across cross-functional teams. • Created structured recommendations to remediate identified labeling and output errors. • Generated benchmarking data to monitor AI output quality longitudinally.

2026 - Present

AI Development Fellow

Text

I built and implemented a quality evaluation framework to review AI-generated textual outputs, focusing on both precision and edge-case analysis. My work included systematic testing and categorization of AI pipeline errors, culminating in comprehensive diagnostic reporting. These efforts directly influenced the product team's quarterly model evaluation roadmap. • Developed a repeatable workflow that benchmarked over 1,000 textual outputs. • Achieved greater than 98% precision through structured ground-truth analysis. • Identified, annotated, and reported on systematic failure patterns in model outputs. • Collaborated across teams to align annotation findings with engineering improvements.

2025 - Present

Education

U

University of Georgia

Bachelor of Business Administration, Management Information Systems

Bachelor of Business Administration

2024

Work History

O

Office of Undergraduate Research

Student Researcher

Kennesaw

2025 - Present

B

Bagwell Center for Market Analysis

Analyst

Kennesaw

2025 - Present