For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
A
Alexis Cuevas

Alexis Cuevas

Independent AI Researcher (Causal Mechanism Design & RLHF Tuning)

USA flagBrazoria, Usa
$19.00/hrIntermediateLabelboxSuperannotate

Key Skills

Software

LabelboxLabelbox
SuperAnnotateSuperAnnotate

Top Subject Matter

AI/ML Research
Causal Reasoning
Model Safety

Top Data Types

TextText
Computer Code ProgrammingComputer Code Programming
DocumentDocument

Top Task Types

RLHFRLHF
Red TeamingRed Teaming
Fine-tuningFine-tuning

Freelancer Overview

Independent AI Researcher (Causal Mechanism Design & RLHF Tuning). Brings 2+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Hugging Face Transformers. AI-training focus includes data types such as Text and labeling workflows including RLHF.

IntermediateEnglishSpanish

Labeling Experience

Independent AI Researcher (Causal Mechanism Design & RLHF Tuning)

TextRLHF
I led independent AI research projects focused on designing and evaluating causal mechanisms in neural architectures. My work involved iterative AI training, including aligning generative models and evaluating their performance via benchmarks. I implemented RLHF-based feedback to refine LLM outputs and research prototypes. • Engineered a 114M parameter Differential Attention head evaluated on Diagnosis Arena. • Performed evaluation and red-teaming of LLMs for safety and causal refusal ("Causal Bomb" detection). • Utilized Hugging Face Transformers and RLHF methodologies for training/feedback cycles. • Documented alignment strategies and efficiency benchmarking of SOTA research models.

I led independent AI research projects focused on designing and evaluating causal mechanisms in neural architectures. My work involved iterative AI training, including aligning generative models and evaluating their performance via benchmarks. I implemented RLHF-based feedback to refine LLM outputs and research prototypes. • Engineered a 114M parameter Differential Attention head evaluated on Diagnosis Arena. • Performed evaluation and red-teaming of LLMs for safety and causal refusal ("Causal Bomb" detection). • Utilized Hugging Face Transformers and RLHF methodologies for training/feedback cycles. • Documented alignment strategies and efficiency benchmarking of SOTA research models.

2023 - Present

Education

C

Cuesta College

Associate of Science , Information Systems

Associate of Science
2015 - 2018

Work History

B

Brazoria Network Solutions

Systems Diagnostic Technician

Brazoria
2022 - 2023
P

Playdate PDX

website maintenance and updating, networking and installation

Portland
2019 - 2023