AI Trainer
As an AI Trainer for LLM projects, I collaborated on fine-tuning models to improve accuracy, tone, and task alignment. My work involved prompt engineering, response evaluation, and systematic assessment of model outputs. I performed RLHF tasks and contributed to model evaluation using structured rubrics. • Designed and performed reinforcement learning from human feedback by ranking and annotating model outputs. • Crafted complex prompts for diverse use cases including education, productivity, and customer support. • Built evaluation frameworks and rubrics to assess factual correctness and alignment with user intent. • Developed tools and contributed insights on LLM behavior correction and automated validation with Python.