AI Evaluator and Quality Specialist
Contributed to a comprehensive project aimed at fine-tuning large language models (LLMs) across multiple linguistic and cultural contexts. Responsibilities included creating and evaluating prompts, providing detailed feedback on AI-generated responses, and conducting preference rankings to optimize model performance. Focused on enhancing model capabilities in adhering to task-specific instructions, ensuring accuracy, coherence, and contextual relevance, while adhering to strict confidentiality and project guidelines.