Lopez Bernardino - Senior Machine Learning Engineer – AI Dataset Design and Verification

Key Skills

Software

No software listed

Top Subject Matter

Stem Domain Expertise

Machine Learning

AI Reasoning

Top Data Types

Text

Document

Top Task Types

Prompt Response Writing SFT

Text Generation

Freelancer Overview

Statistical Modeling Framework Author for AI Training. Core strengths include Internal, Proprietary Tooling, and OpenAI API. AI-training focus includes data types such as Computer Code and Programming and labeling workflows including Text Generation, Evaluation, and Rating.

ExpertEnglish

Labeling Experience

ML Problem Design Suite – AI Reasoning Benchmark Dataset Authoring

TextPrompt Response Writing SFT

I designed a library of computationally intensive STEM and ML problems for advanced reasoning and coding skill evaluation, with full documentation and validation. These datasets targeted AI benchmarking and model reasoning tasks, ensuring all prompts and solutions adhered to advanced reasoning requirements. My efforts supported internal and external benchmarking of AI model capabilities through comprehensive scenario coverage.• Authored 80+ ML/STEM problem-solution pairs for AI evaluation • Ensured dataset quality through rigorous prompt validation workflows • Applied Python stack for problem coding and testing • Produced all materials with high-standard technical documentation

2023 - Present

Senior Machine Learning Engineer – AI Dataset Design and Verification

Text

I designed and verified computationally intensive STEM and ML problem sets for internal AI training datasets at an advanced reasoning level. This included reviewing and validating problem-solution pairs, integrating generative AI tools for automated quality verification, and authoring thorough technical documentation. My responsibilities ensured high scientific standards and reproducibility across over 100 unique dataset entries for AI model training and evaluation.• Created, validated, and documented problem sets for AI model reasoning • Led end-to-end workflow from design to final dataset verification • Integrated GenAI tools (OpenAI API, IBM Watson, LangChain) for quality control • Authored and maintained solution write-ups for cross-team dataset review

2021 - Present

Generative AI-Assisted Problem Verification Pipeline – Dataset Quality Assurance

Text

I built an automated problem-verification pipeline that used the OpenAI API and Python scripting to cross-check, rate, and validate ML solution quality for internal training datasets. This pipeline flagged inconsistencies, validated claims, and generated structured documentation to maintain data quality. My implementation reduced review time and improved reliability for AI training problem sets.• Automated statistical and factual claim validation in datasets • Integrated GenAI tools for problem and solution verification • Produced structured, peer-reviewed documentation for problem sets • Supported dataset maintenance aligned to scientific rigor standards

2022 - 2022

Machine Learning Engineer – Curriculum Data Labeling and Problem Authoring

TextPrompt Response Writing SFT

I designed and wrote computationally challenging Python problem sets for internal machine learning training and onboarding curricula. These problem sets were used to assess both practical and theoretical ML proficiency across a range of data science tools. The process included writing prompts, solutions, and assessment guidelines to train and test both human engineers and AI models in various tasks.• Developed and validated advanced problem prompts and solutions • Created internal curricula for ML skill evaluation and training • Authored comprehensive documentation for reproducibility and QA • Used GenAI tools to automate and enhance prompt creation workflows

2018 - 2021

Statistical Modeling Framework Author for AI Training

Text Generation

Developed a Python statistical modeling framework with built-in documentation generators for predictive analytics training. The framework was adopted by multiple product teams for research and educational use. Generated C1-level English technical write-ups automatically from model results for AI training purposes. • Created reusable modules for time-series forecasting and anomaly detection. • Automated scientific documentation for education and reproducibility. • Facilitated internal team adoption of standardized modeling tools. • Enhanced access to AI training content through statistical tooling.

2019 - 2020

Education

U

University of Texas at Austin

Bachelor of Science, Computer Science

Bachelor of Science

2012 - 2016

Work History

I

IBM

Senior Machine Learning Engineer

Austin, TX

2021 - Present

D

Dell Technologies

Machine Learning Engineer

Round Rock, TX

2018 - 2021