Michael Botkin - Designer and Implementer of Data Annotation Workflow for LLM and RAG Output Evaluation

Key Skills

Software

Encord

Labelbox

Label Studio

CVAT

Datasaur

Doccano

Scale AI

Appen

Top Subject Matter

Enterprise RAG & Document Intelligence

Retail Vendor Contracts

Revenue Prediction and Anomaly Detection for Market Intelligence

Top Data Types

Document

Text

Image

Top Task Types

Data Collection

RLHF

Text Summarization

Object Detection

Bounding Box

Freelancer Overview

Designer and Implementer of Data Annotation Workflow for LLM and RAG Output Evaluation. Brings 13+ years of professional experience across legal operations, contract review, compliance, and structured analysis. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Science, Virginia Tech (2013). AI-training focus includes data types such as Document and labeling workflows including Evaluation, Rating, and Data Collection.

ExpertEnglish

Labeling Experience

Designer and Implementer of Data Annotation Workflow for LLM and RAG Output Evaluation

Document

I designed and implemented human-in-the-loop data annotation workflows to create evaluation datasets for LLM and RAG outputs. Using internal or proprietary annotation tooling, I built processes to assess retrieval accuracy and response relevance of generative models applied to enterprise document intelligence. These workflows included systematic evaluation and error analysis to reduce hallucinations in contract analysis AI systems. • Developed and maintained evaluation datasets for legal and procurement documents • Collaborated with AI teams to capture model performance metrics and user feedback • Automated portions of the annotation workflow for efficiency and scalability • Ensured compliance with data privacy and security standards throughout annotation

2025 - Present

Dataset Preparation and Validation for ML Models in Financial Analytics

DocumentData Collection

I partnered with data science teams to prepare and validate labeled datasets for machine learning models in revenue prediction and anomaly detection. My role included collecting, curating, and reviewing labeled data to improve the quality of supervised learning for production models. The datasets enabled the evaluation and reliability of deployed AI features in financial analytics tools. • Coordinated with cross-functional teams to define labeling guidelines • Performed quality checks and validation on labeled financial data • Iteratively refined datasets based on model performance and feedback • Documented labeling processes for ongoing improvement and knowledge transfer

2022 - 2025

Education

V

Virginia Tech

Bachelor of Science, Computer Science

Bachelor of Science

2009 - 2013

Work History

A

Anblicks

Senior Full Stack Engineer

Dallas

2025 - Present

A

AirDNA

Senior Software Engineer

Denver

2022 - 2025