For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
J

Jeremiah Souza

AI-Focused Evaluation & Prompt Design

USA flag
Perry, Usa
$20.00/hrExpertClickworkerData Annotation TechMindrift

Key Skills

Software

ClickworkerClickworker
Data Annotation TechData Annotation Tech
MindriftMindrift
OneFormaOneForma
RemotasksRemotasks
Scale AIScale AI
TolokaToloka

Top Subject Matter

AI-Generated Code and Text Evaluation
Text generation

Top Data Types

TextText
DocumentDocument
Computer Code ProgrammingComputer Code Programming

Top Task Types

Text Generation
Computer Programming Coding
Text Summarization
Question Answering
RLHF
Evaluation Rating

Freelancer Overview

I have experience reviewing and improving both text and code outputs, using a structured approach to check for correctness, clarity, and overall quality. I’ve worked on writing prompts, comparing multiple responses, and identifying issues such as missing logic, errors, or unclear explanations. I also create test cases and think through edge cases to make sure outputs are reliable. With over 10 years of software engineering experience, I’m strong at understanding code and spotting problems quickly. I’m detail-oriented, consistent, and comfortable following clear guidelines, even in repetitive tasks. I focus on making sure outputs are accurate, easy to understand, and useful.

ExpertEnglish

Labeling Experience

AI-Focused Evaluation & Prompt Design

TextText Generation
I designed prompts for generating and evaluating code and text outputs, with a focus on both API logic and structured response evaluation. Through this role, I reviewed both human and AI-generated outputs for correctness, clarity, and maintainability, mirroring industry LLM evaluation workflows. I created test cases, ranked responses, and identified edge cases to improve output robustness and quality. • Crafted and iterated on prompt designs for AI code and text generation tasks • Systematically evaluated and ranked AI and human responses against rigorous criteria • Designed and applied test cases and scenarios to validate generated outputs • Identified hallucinations, missing logic, and ambiguous instructions to refine AI results

I designed prompts for generating and evaluating code and text outputs, with a focus on both API logic and structured response evaluation. Through this role, I reviewed both human and AI-generated outputs for correctness, clarity, and maintainability, mirroring industry LLM evaluation workflows. I created test cases, ranked responses, and identified edge cases to improve output robustness and quality. • Crafted and iterated on prompt designs for AI code and text generation tasks • Systematically evaluated and ranked AI and human responses against rigorous criteria • Designed and applied test cases and scenarios to validate generated outputs • Identified hallucinations, missing logic, and ambiguous instructions to refine AI results

2020 - Present

Education

U

University of Central Florida

Bachelor of Science, Computer Science

Bachelor of Science
2011 - 2015

Work History

T

Techmd

Senior Software Engineer

Perry
2023 - Present
R

Reindicator

Full-Stack Software Engineer

N/A
2020 - 2023