Freelance AI Evaluation & Software Testing Specialist
I performed comprehensive AI model evaluation and data labeling to train Large Language Models (LLMs). My responsibilities included reviewing and scoring AI-generated textual outputs, conducting reinforcement learning from human feedback (RLHF) tasks, and participating in prompt-based training and evaluation. I consistently delivered high-quality linguistic and analytical data to improve AI performance and accuracy. • Evaluated AI text responses for appropriateness, relevance, and safety • Labeled and scored search results and chatbot outputs across multiple platforms • Participated in RLHF and prompt engineering projects • Utilized leading annotation software and proprietary tools for data labeling tasks.