AI Response Evaluator / Data Annotator
This role involves evaluating AI-generated text responses for quality and compliance with guidelines. I participate in ranking AI output for reinforcement learning from human feedback, enhancing model alignment. Tasks are performed on prominent data labeling platforms in an ongoing, remote capacity. • Evaluated AI and LLM outputs for accuracy, helpfulness, and safety compliance • Ranked text responses and provided preference data for reinforcement learning pipelines • Annotated text for categorization and quality filtering • Used DataAnnotation.tech and OpenTrain AI to complete labeling assignments