AI Training & RLHF Tasks (Work Simulation)
I evaluated AI-generated responses for correctness, business tone, and professional standards. I performed rubric-based scoring, pairwise ranking, and executed RLHF tasks to align model outputs with human intent and quality. I created adversarial prompts and test cases to identify weaknesses and edge cases in LLM reasoning. • Assessed quality and bias of AI outputs with comprehensive evaluation. • Authored complex and challenging prompts for stress-testing models. • Delivered improvements and corrections based on evaluation findings. • Contributed to the optimization of model safety and alignment via RLHF.