Gonzalo Montero Cavero - AI Training / Evaluation Specialist (Freelance)

Key Skills

Software

No software listed

Top Subject Matter

General AI behavior

task evaluation

ambiguity analysis

Top Data Types

Text

Document

Image

Top Task Types

Evaluation/Rating

Question Answering

Text Summarization

Prompt + Response Writing (SFT)

Classification

Text Generation

Data Collection

Freelancer Overview

AI Training / Evaluation Specialist (Freelance). Brings 7+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Bachelor of Architecture, Peruvian University of Applied Sciences (2021). AI-training focus includes data types such as Text and labeling workflows including Evaluation and Rating.

IntermediateEnglish

Labeling Experience

AI Training / Evaluation Specialist (Freelance)

Text

I evaluated AI-generated outputs by analyzing multi-step tasks for logical errors and inconsistencies. I structured and tested scenarios to assess AI system responses under ambiguous conditions and evolving input contexts. My work focused on iteratively applying classification rules and refining AI outputs for accuracy and reliability. • Designed repeatable evaluation protocols and structured analysis frameworks. • Identified and documented failure cases, reasoning gaps, and ambiguous outputs. • Collaborated with AI systems for multiple interaction rounds and output refinement. • Ensured feedback enabled continuous improvement in AI performance.

2026 - Present

Independent AI Scenario Designer / Evaluator

Text

I led an independent project to analyze and evaluate AI behavior across repeated interactions, changing inputs, and complex task scenarios. Using small-scale Python prototypes and structured logic, I recorded and assessed outputs for repeatability and traceability. The work emphasized systematic scenario construction for testing AI under real-world ambiguity and constraints. • Developed reproducible artifacts and validation protocols based on post-execution analysis. • Organized and structured outputs for consistent comparison and reproducibility. • Identified patterns, inconsistencies, and failure cases through iterative interactions. • Utilized Python scripting and GitHub for output recording and structured experimentation.

2025 - Present

Education

P

Peruvian University of Applied Sciences

Bachelor of Architecture, Architecture

Bachelor of Architecture

2021 - 2021

Work History

A

Arandela

Founder and Architect

Lima

2021 - 2025

T

Techo Propio Program

Construction Supervisor

Lima

2019 - 2021