Key Skills

Software

Mindrift

Top Subject Matter

AI Alignment

Model Evaluation

Safety Auditing

Top Data Types

Text

Video

Top Task Types

RLHF

Freelancer Overview

I am a Senior AI Quality Auditor and Forensic Analyst specializing in RLHF and model alignment for frontier LLMs, focused on the clinical evaluation of high-complexity reasoning tasks. My expertise lies in Chain-of-Thought auditing, verifying the logical validity of each step in a model’s reasoning process, ensuring process-correctness instead of simple outcome-accuracy. What sets me apart is a forensic approach to RAG verification and Adversarial Red-Teaming. Leveraging a background in Behavioral Science, I identify subtle hallucinations of omission, source-misattributions, and linguistic biases that standard automated filters miss. I am proficient in technical formatting (Markdown, LaTeX), data automation (Python), and the development of granular grading rubrics. My goal is to provide invisible efficiency: high-fidelity, zero-friction data that optimizes the signal-to-noise ratio for model safety and reasoning

Intermediate

Labeling Experience

Multimodal Red-Teaming & Adversarial Auditing

VideoRed Teaming

Specialized in the adversarial stress-testing of Vision-Language Models (VLM) to identify systemic vulnerabilities in multimodal perception and alignment. Focused on bypassing safety guardrails and identifying logical breakpoints where visual and textual inputs conflict. - Multimodal Jailbreak Identification: Developed and executed complex adversarial prompts designed to bypass text-based safety filters through visual injection. This included testing the model’s susceptibility to "hidden" instructions within images (OCR-based exploits) and contradictory visual-textual pairings that could trigger prohibited outputs. - Adversarial Perception Testing: Systematically identified "optical hallucinations" by presenting the model with high-complexity visual scenes, ambiguous spatial relationships, and adversarial perturbations. I focused on forcing failures in object-relationship grounding—specifically where the model would "see" non-existent safety hazards or overlook critical visual context. - Behavioral Bias & Safety Auditing: Leveraged a Behavioral Science framework to red-team for subtle demographic and cultural biases triggered by visual cues. I audited for harmful stereotyping in image-to-text generation, ensuring that model-generated descriptions remained objective and neutralized "reflexive" biases inherent in the training data. - Vulnerability Mapping: Provided detailed forensic reports on "model drift" when processing edge-case visual data, such as low-light technical schematics or distorted medical imagery, where a failure in perception could lead to high-stakes logical errors in downstream RAG (Retrieval-Augmented Generation) tasks.

2025 - Present

Acoustic-to-Semantic Alignment & Pragmatic Auditing

AudioTranscription

Pragmatic Intent Mapping: Specialized in high-fidelity transcription focused on capturing prosodic cues to distinguish between satirical, earnest, and sarcastic intents. Paralinguistic Feature Detection: Systematic identification of pitch, cadence, and emphasis to ensure semantic fidelity in sentiment-sensitive datasets, preventing "literal-read" errors in model training. Acoustic Hallucination Mitigation: Detection and correction of phonetic misinterpretations involving technical jargon, regional accents, and industry-specific terminology. Contextual Hardening: Cross-referencing proper nouns and technical specifications to ensure "ground truth" reliability for high-stakes RAG (Retrieval-Augmented Generation) applications.

2024 - Present

High-Fidelity Text RLHF

MindriftTextRLHF

Gold-Standard Calibration: Top-tier contributor (top 5%) providing validated "ground truth" data for frontier model Reward Model (RM) and Supervised Fine-Tuning (SFT) streams. Forensic Error Attribution: Systematic mapping of model failures to specific causal factors, differentiating between retrieval-based hallucinations and procedural logic breakdowns. Chain-of-Thought (CoT) Validation: Step-by-step auditing of internal reasoning paths for mathematical, logical, and technical prompts to ensure process integrity over outcome-only accuracy. Senior Review & Rubric Alignment: High-fidelity auditing of entry-level datasets to resolve semantic ambiguities and maintain maximum signal-to-noise ratios in large-scale data pipelines. Information Density Optimization: Synthesis and audit of high-utility semantic summaries, balancing factual "purity" with technical SEO constraints and strict structural requirements.

2024 - Present

Education

No Education added yet

David H. hasn’t added any Education History to their OpenTrain profile yet.

Work History

C

Comet Ping Pong

Bar Manager

Washington, DC

2019 - 2024

B

Boone Saloon

General Manager

Boone, NC

2009 - 2019