Prajwal _patil - AI Dataset Labeling Specialist – High-Performance Low-Latency RAG System

Key Skills

Software

Other

Top Subject Matter

AI Retrieval Systems

Speech-to-Text Model Evaluation

Crticial Reasoning and Prompt Engineering

Top Data Types

Document

Audio

Image

Top Task Types

Classification

Transcription

Freelancer Overview

AI Dataset Labeling Specialist – High-Performance Low-Latency RAG System. Brings 1+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal, Proprietary Tooling, and Other. Education includes Bachelor of Engineering, Yeshwantrao Chavan College of Engineering (2027) and Diploma, Government Polytechnic, Nagpur (2024). AI-training focus includes data types such as Document and Audio and labeling workflows including Classification and Transcription.

IntermediateEnglishHindi

Labeling Experience

Audio Data Labeling & Transcription Specialist – Multimodal Audio/Video Annotation System

OtherAudioTranscription

Processed and labeled multimodal and multilingual audio datasets for speech-to-text model evaluation, focusing on noisy audio recordings. Applied transcription guidelines and performed validation checks to maintain high dataset reliability and accuracy. Reduced manual transcription workload by optimizing the annotation process and improving transcription efficiency. • Achieved 78% accuracy on transcriptions of noisy audio data. • Utilized Whisper, PyTorch, and WER/CER tools for evaluation and labeling. • Detected and flagged inconsistencies to improve overall dataset quality. • Enhanced workflow efficiency within the transcription pipeline.

2023 - 2024

AI Dataset Labeling Specialist – High-Performance Low-Latency RAG System

DocumentClassification

Structured and categorized large document datasets for an AI retrieval system using classification pipelines and tagging techniques. Evaluated dataset quality and flagged inconsistencies during model testing to improve retrieval accuracy. Applied classification frameworks and annotation guidelines to ensure consistency and reliability in labeled data. • Used classification pipelines for systematic dataset organization. • Enhanced retrieval model accuracy by meticulously refining dataset labels. • Worked with LLMs, LangChain, Vector DBs, and Python-based tools. • Recognized for dataset quality by government and industry experts.

2023 - 2023

Education

Y

Yeshwantrao Chavan College of Engineering

Bachelor of Engineering, Computer Technology

Bachelor of Engineering

2024 - 2027

G

Government Polytechnic, Nagpur

Diploma, Artificial Intelligence and Machine Learning

Diploma

2021 - 2024

Work History

U

Unified Mentor

Data Analyst Intern

Remote

2024 - 2024