RLHF, Data Labeling, Model Evaluation, Red-Teaming (Core Skills Portion)
Worked on Reinforcement Learning from Human Feedback (RLHF) and AI data labeling as part of core skills and competencies. Participated in tasks related to prompt engineering, data labeling, model evaluation, and adversarial testing using state-of-the-art approaches. Collaborated in large language model (LLM) evaluation and benchmarking activities. • Applied prompt-response writing and red-teaming methodologies • Labeled and rated AI-generated text outputs for quality and accuracy • Engaged in adversarial testing of LLMs for robustness • Evaluated and benchmarked AI models for performance