For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Tatiana Gutierrez

Tatiana Gutierrez

Full Stack Software Developer | Data & AI Solutions

Colombia flagBogotá, Colombia
$28.00/hrExpertCVATGoogle Cloud Vertex AILabelbox

Key Skills

Software

CVATCVAT
Google Cloud Vertex AIGoogle Cloud Vertex AI
LabelboxLabelbox
Label StudioLabel Studio
LionbridgeLionbridge
RoboflowRoboflow
Internal/Proprietary Tooling

Top Subject Matter

No subject matter listed

Top Data Types

Computer Code ProgrammingComputer Code Programming
DocumentDocument
TextText

Top Task Types

Computer Programming Coding
Evaluation Rating
Function Calling
Prompt Response Writing SFT
Question Answering
Red Teaming
RLHF
Text Generation
Text Summarization

Freelancer Overview

I have experience working with AI training data, model evaluation, and data preparation as part of software and AI-driven solutions. My background combines software development, data analytics, and AI agent integration, allowing me to understand how high-quality labeled data directly impacts model performance. I have worked with both structured and unstructured data, supporting tasks such as data validation, classification, prompt evaluation, and output quality analysis. In addition to hands-on data work, I have built and integrated AI-powered tools, chatbots, and automation workflows using modern LLM frameworks. This gives me a strong understanding of annotation guidelines, edge cases, bias detection, and consistency, ensuring training data is accurate, scalable, and aligned with real-world use cases.

ExpertEnglishSpanishPortuguese

Labeling Experience

LLM Fine-Tuning and Safety Evaluation for Dialogue Systems

OtherTextEntity Ner ClassificationClassification
Executed a large-scale project to fine-tune and improve the safety and performance of a large language model (LLM). Scope included analyzing and labeling complex textual prompts and model responses across diverse domains to identify errors, biases, and unsafe outputs. Developed and adhered to detailed annotation guidelines for quality and consistency. Tasks involved writing and evaluating high-quality prompt-response pairs for supervised fine-tuning (SFT), classifying undesirable content, and performing red-teaming to proactively discover potential model failures. Processed thousands of data samples, contributing directly to the enhancement of the model's dialogue capabilities and alignment with safety protocols.

Executed a large-scale project to fine-tune and improve the safety and performance of a large language model (LLM). Scope included analyzing and labeling complex textual prompts and model responses across diverse domains to identify errors, biases, and unsafe outputs. Developed and adhered to detailed annotation guidelines for quality and consistency. Tasks involved writing and evaluating high-quality prompt-response pairs for supervised fine-tuning (SFT), classifying undesirable content, and performing red-teaming to proactively discover potential model failures. Processed thousands of data samples, contributing directly to the enhancement of the model's dialogue capabilities and alignment with safety protocols.

2025 - 2025

LLM Evaluation, Prompt Engineering, and AI Agent Training

Internal Proprietary ToolingTextEvaluation RatingComputer Programming Coding
Worked on AI-driven projects involving LLMs, chatbots, and automation tools, contributing to AI training and evaluation workflows. The project focused on improving model reliability, usability, and alignment with real-world user interactions. Performed tasks including prompt and response writing for supervised fine-tuning, evaluation and rating of LLM outputs, and validation of chatbot responses used in web and automation environments. Reviewed outputs for accuracy, coherence, bias, and hallucinations, providing structured feedback to improve model behavior. Additionally supported function-calling workflows and coding-related tasks, ensuring model outputs aligned with expected logic and system constraints. Followed strict quality guidelines to maintain consistency, edge case coverage, and high annotation accuracy across datasets.

Worked on AI-driven projects involving LLMs, chatbots, and automation tools, contributing to AI training and evaluation workflows. The project focused on improving model reliability, usability, and alignment with real-world user interactions. Performed tasks including prompt and response writing for supervised fine-tuning, evaluation and rating of LLM outputs, and validation of chatbot responses used in web and automation environments. Reviewed outputs for accuracy, coherence, bias, and hallucinations, providing structured feedback to improve model behavior. Additionally supported function-calling workflows and coding-related tasks, ensuring model outputs aligned with expected logic and system constraints. Followed strict quality guidelines to maintain consistency, edge case coverage, and high annotation accuracy across datasets.

2023 - 2025
CVAT

Data labeling for image recognition

CVATImageClassificationObject Detection
Labeled product images for an AI model aimed at improving image recognition for inventory management. Tagged items based on categories, colors, sizes, and other features to help the system identify products in stock, aiding in automated restocking processes and improving the customer shopping experience.

Labeled product images for an AI model aimed at improving image recognition for inventory management. Tagged items based on categories, colors, sizes, and other features to help the system identify products in stock, aiding in automated restocking processes and improving the customer shopping experience.

2023 - 2024
Labelbox

Data labeling for AI Model

LabelboxTextComputer Programming CodingPrompt Response Writing SFT
Labeled and categorized text-based user interaction data in the e-commerce industry, focusing on tagging preferences, behaviors, and relevant keywords. This labeling process was essential for training a recommendation model that improves the accuracy of personalized product suggestions for users, enhancing their shopping experience.

Labeled and categorized text-based user interaction data in the e-commerce industry, focusing on tagging preferences, behaviors, and relevant keywords. This labeling process was essential for training a recommendation model that improves the accuracy of personalized product suggestions for users, enhancing their shopping experience.

2023 - 2023
Prodigy

Speech recognition

ProdigyAudioClassificationAudio Recording
Labeled medical audio data, including doctor-patient conversations, to train speech recognition models for transcribing medical notes. Focused on labeling medical terminology, symptoms, conditions, and patient histories to ensure the system could accurately transcribe and categorize the information for use in Electronic Health Records (EHR).

Labeled medical audio data, including doctor-patient conversations, to train speech recognition models for transcribing medical notes. Focused on labeling medical terminology, symptoms, conditions, and patient histories to ensure the system could accurately transcribe and categorize the information for use in Electronic Health Records (EHR).

2022 - 2022

Education

U

Universidad Jorge Tadeo Lozano

Bachelor of Science, Data Science and Computational Simulation

Bachelor of Science
2021 - 2024
U

Universidad Jorge Tadeo Lozano

Bachelor of Science (B.Sc.), Data Science & Computational Simulation

Bachelor of Science (B.Sc.)
2021 - 2024

Work History

B

Banco Davivienda S.A.

Software Developer | Full Stack, Data & AI Solutions

Bogotá
2023 - Present
B

Banco Davivienda

Data Science & Automation Intern

Bogotá
2025 - Present