Aether Project - AI Content Evaluation
Worked as an AI Content Evaluator for the Aether project via Outlier. Responsible for high-quality data labeling and Reinforcement Learning from Human Feedback (RLHF) to fine-tune Large Language Models (LLMs). Tasks included assessing model responses for factual accuracy, safety, and alignment with complex engineering and technical constraints.