For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
M
Marcus Silva

Marcus Silva

LLM Output Evaluation and Prompt Testing (Klaviyo)

Brazil flagSao Paulo, Brazil
$50.00/hrExpert

Key Skills

Software

No software listed

Top Subject Matter

LLM output evaluation in CRM/enterprise automation
Legal Services & Contract Review
Regulatory Compliance & Risk Analysis

Top Data Types

TextText
DocumentDocument

Top Task Types

Computer Programming/CodingComputer Programming/Coding

Freelancer Overview

LLM Output Evaluation and Prompt Testing (Klaviyo). Brings 12+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Science, Universidade Estadual de Campinas (Unicamp) (2015). AI-training focus includes data types such as Text and labeling workflows including Evaluation and Rating.

ExpertEnglish

Labeling Experience

LLM Output Evaluation and Prompt Testing (Klaviyo)

Text
Led the implementation of prompt versioning, evaluation frameworks, and hallucination mitigation strategies for deterministic AI outputs in production. Worked on optimizing the quality and reliability of outputs from large language models (LLMs) through structured testing and evaluation. Integrated output quality steps directly into live CRM workflows using custom pipeline deployments. • Designed and executed evaluation frameworks for model output validation • Performed systematic rating and prompt testing of LLM results • Oversaw hallucination detection and mitigation labeling workflows • Mentored teams on responsible AI evaluation and tuning

Led the implementation of prompt versioning, evaluation frameworks, and hallucination mitigation strategies for deterministic AI outputs in production. Worked on optimizing the quality and reliability of outputs from large language models (LLMs) through structured testing and evaluation. Integrated output quality steps directly into live CRM workflows using custom pipeline deployments. • Designed and executed evaluation frameworks for model output validation • Performed systematic rating and prompt testing of LLM results • Oversaw hallucination detection and mitigation labeling workflows • Mentored teams on responsible AI evaluation and tuning

2021 - 2026

Education

U

Universidade Estadual de Campinas (Unicamp)

Bachelor of Science, N/A

Bachelor of Science
2011 - 2015

Work History

K

Klaviyo

Senior AI / Full Stack Engineer

Boston
2021 - 2026
F

Firework

Full Stack / AI Developer

Sao Paulo
2018 - 2021