For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
M

Michael Botkin

Designer and Implementer of Data Annotation Workflow for LLM and RAG Output Evaluation

USA flagDallas, Usa
$45.00/hrExpertEncordLabelboxLabel Studio

Key Skills

Software

EncordEncord
LabelboxLabelbox
Label StudioLabel Studio
CVATCVAT
DatasaurDatasaur
DoccanoDoccano
Scale AIScale AI
AppenAppen

Top Subject Matter

Enterprise RAG & Document Intelligence
Retail Vendor Contracts
Revenue Prediction and Anomaly Detection for Market Intelligence

Top Data Types

DocumentDocument
TextText
ImageImage

Top Task Types

Data CollectionData Collection
RLHFRLHF
Text SummarizationText Summarization
Object DetectionObject Detection
Bounding BoxBounding Box

Freelancer Overview

Designer and Implementer of Data Annotation Workflow for LLM and RAG Output Evaluation. Brings 13+ years of professional experience across legal operations, contract review, compliance, and structured analysis. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Science, Virginia Tech (2013). AI-training focus includes data types such as Document and labeling workflows including Evaluation, Rating, and Data Collection.

ExpertEnglish

Labeling Experience

Designer and Implementer of Data Annotation Workflow for LLM and RAG Output Evaluation

Document
I designed and implemented human-in-the-loop data annotation workflows to create evaluation datasets for LLM and RAG outputs. Using internal or proprietary annotation tooling, I built processes to assess retrieval accuracy and response relevance of generative models applied to enterprise document intelligence. These workflows included systematic evaluation and error analysis to reduce hallucinations in contract analysis AI systems. • Developed and maintained evaluation datasets for legal and procurement documents • Collaborated with AI teams to capture model performance metrics and user feedback • Automated portions of the annotation workflow for efficiency and scalability • Ensured compliance with data privacy and security standards throughout annotation

I designed and implemented human-in-the-loop data annotation workflows to create evaluation datasets for LLM and RAG outputs. Using internal or proprietary annotation tooling, I built processes to assess retrieval accuracy and response relevance of generative models applied to enterprise document intelligence. These workflows included systematic evaluation and error analysis to reduce hallucinations in contract analysis AI systems. • Developed and maintained evaluation datasets for legal and procurement documents • Collaborated with AI teams to capture model performance metrics and user feedback • Automated portions of the annotation workflow for efficiency and scalability • Ensured compliance with data privacy and security standards throughout annotation

2025 - Present

Dataset Preparation and Validation for ML Models in Financial Analytics

DocumentData Collection
I partnered with data science teams to prepare and validate labeled datasets for machine learning models in revenue prediction and anomaly detection. My role included collecting, curating, and reviewing labeled data to improve the quality of supervised learning for production models. The datasets enabled the evaluation and reliability of deployed AI features in financial analytics tools. • Coordinated with cross-functional teams to define labeling guidelines • Performed quality checks and validation on labeled financial data • Iteratively refined datasets based on model performance and feedback • Documented labeling processes for ongoing improvement and knowledge transfer

I partnered with data science teams to prepare and validate labeled datasets for machine learning models in revenue prediction and anomaly detection. My role included collecting, curating, and reviewing labeled data to improve the quality of supervised learning for production models. The datasets enabled the evaluation and reliability of deployed AI features in financial analytics tools. • Coordinated with cross-functional teams to define labeling guidelines • Performed quality checks and validation on labeled financial data • Iteratively refined datasets based on model performance and feedback • Documented labeling processes for ongoing improvement and knowledge transfer

2022 - 2025

Education

V

Virginia Tech

Bachelor of Science, Computer Science

Bachelor of Science
2009 - 2013

Work History

A

Anblicks

Senior Full Stack Engineer

Dallas
2025 - Present
A

AirDNA

Senior Software Engineer

Denver
2022 - 2025