AI Training & Data Quality Specialist
As an AI Training & Data Quality Specialist at DataAnnotation.tech, I conducted reinforcement learning from human feedback (RLHF) for conversational and image-generation AI models. My responsibilities included evaluating multimodal AI outputs and authoring high-quality training datasets. I ensured adherence to complex style guides and technical standards to improve model behavior and logic improvements. • Performed detailed RLHF to align LLMs and text-to-image models. • Evaluated prompt adherence, factual accuracy, and visual/technical coherence of AI outputs. • Authored technical justifications and rubrics for dataset quality. • Ensured safety and mitigated bias in multimodal model outputs.