AI Data Trainer / Model Evaluator (Freelance, Remote)
As an AI Data Trainer and Model Evaluator, I evaluated AI-generated text responses for accuracy and safety. I ranked outputs to facilitate reinforcement learning from human feedback (RLHF) for large language models. I was responsible for identifying hallucinations, bias, and factual errors in LLM outputs. • Evaluated and rated text responses for relevance and truthfulness. • Ranked outputs to optimize model performance. • Detected bias and hallucination in model-generated content. • Developed prompts to test model reasoning abilities.