For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
R

Richard V

LLM Output Evaluator and Prompt Engineer

USA flag
Houston, Usa
$20.00/hrExpertClickworkerSurge AIAppen

Key Skills

Software

ClickworkerClickworker
Surge AISurge AI
AppenAppen

Top Subject Matter

AI/Natural Language Processing
AI/Image Generation
AI/Video Generation

Top Data Types

TextText
ImageImage
VideoVideo
DocumentDocument

Top Task Types

Prompt Response Writing SFT

Freelancer Overview

LLM Output Evaluator and Prompt Engineer. Brings 15+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Certificate in Cybersecurity, ISC2 (2023). AI-training focus includes data types such as Text, Image, and Video and labeling workflows including Evaluation, Rating, and Prompt + Response Writing (SFT).

ExpertEnglish

Labeling Experience

AI Video Prompting and Model Evaluation Specialist

VideoPrompt Response Writing SFT
Developed and tested structured prompts for AI video generation models including Flow, Higgsfield AI, Veo, Seedance, and Kling. Performed iterative refinement of shot planning, key framing, and output assessment to ensure high-quality, consistent AI-generated video content. Supplied comprehensive documentation and analysis to inform adjustments in video generation strategies and workflows. • Created and managed prompts for scene design, pacing, and transition in video AI models. • Evaluated model reliability, adherence to prompt instructions, and visual storytelling quality. • Provided structured feedback loops for improving AI model outputs and processes. • Documented findings to support video model training and prompt engineering guidelines.

Developed and tested structured prompts for AI video generation models including Flow, Higgsfield AI, Veo, Seedance, and Kling. Performed iterative refinement of shot planning, key framing, and output assessment to ensure high-quality, consistent AI-generated video content. Supplied comprehensive documentation and analysis to inform adjustments in video generation strategies and workflows. • Created and managed prompts for scene design, pacing, and transition in video AI models. • Evaluated model reliability, adherence to prompt instructions, and visual storytelling quality. • Provided structured feedback loops for improving AI model outputs and processes. • Documented findings to support video model training and prompt engineering guidelines.

2024 - Present

Image Generation Prompt Engineer and Evaluator

ImagePrompt Response Writing SFT
Designed and refined prompts for image generation models including Midjourney, GPT Image, and Nano Banana. Crafted detailed prompt variations and evaluated visual output consistency, clarity, and subject accuracy for AI art/model behaviors. Provided structured feedback and engaged in iterative improvement cycles to enhance model response to prompt structures. • Developed image and image-reference prompts for diverse artistic and representational tasks. • Assessed model behavior relating to creativity, style coherence, and subject representation. • Collaborated on prompt strategies for iterative visual output optimization. • Participated in output documentation and best practice sharing for prompt engineering.

Designed and refined prompts for image generation models including Midjourney, GPT Image, and Nano Banana. Crafted detailed prompt variations and evaluated visual output consistency, clarity, and subject accuracy for AI art/model behaviors. Provided structured feedback and engaged in iterative improvement cycles to enhance model response to prompt structures. • Developed image and image-reference prompts for diverse artistic and representational tasks. • Assessed model behavior relating to creativity, style coherence, and subject representation. • Collaborated on prompt strategies for iterative visual output optimization. • Participated in output documentation and best practice sharing for prompt engineering.

2023 - Present

LLM Output Evaluator and Prompt Engineer

Text
Evaluated and rated outputs of Large Language Models (LLMs) such as ChatGPT, Claude, Grok, and others across various prompt and response scenarios. Worked to detect hallucinations, inconsistencies, and biases in AI-generated text using structured evaluation guidelines. Refined and assessed prompts to improve instruction-following, clarity, and output quality, and participated in model comparison tasks for research workflows. • Compared outputs and behaviors of multiple LLMs across diverse benchmarks. • Conducted detailed quality assessments, context-window analyses, and bias identification protocols. • Used structured evaluation rubrics and prompt engineering to enhance model reliability. • Documented findings and provided feedback to guide LLM fine-tuning and improvements.

Evaluated and rated outputs of Large Language Models (LLMs) such as ChatGPT, Claude, Grok, and others across various prompt and response scenarios. Worked to detect hallucinations, inconsistencies, and biases in AI-generated text using structured evaluation guidelines. Refined and assessed prompts to improve instruction-following, clarity, and output quality, and participated in model comparison tasks for research workflows. • Compared outputs and behaviors of multiple LLMs across diverse benchmarks. • Conducted detailed quality assessments, context-window analyses, and bias identification protocols. • Used structured evaluation rubrics and prompt engineering to enhance model reliability. • Documented findings and provided feedback to guide LLM fine-tuning and improvements.

2022 - Present

Education

I

ISC2

Certificate in Cybersecurity, Cybersecurity

Certificate in Cybersecurity
2023 - 2023

Work History

L

Linden Birch Inc.

Founder & Operations Lead

Houston
2012 - Present
G

Guardianboost.com

Operations & Quality Lead

Houston
2017 - 2023