AI Trainer & Generalist
As an AI Trainer & Generalist, I performed data annotation and evaluation for large language model (LLM) systems. My work included prompt engineering, adversarial testing, and rating model responses for quality, factuality, and instruction compliance. The tasks covered generalist annotation, including code, Q&A, reasoning, and creative writing. • LLM response evaluation for accuracy and adherence to instructions. • Prompt engineering and adversarial prompt crafting to identify model weaknesses. • Annotated datasets spanning code, factual Q&A, reasoning, and creative tasks. • Delivered detailed feedback on model performance and prompt quality.