LLM evaluation
Evaluate the performance of multiple models at answering complex prompts, including contextual documentation sometimes. Evaluation of multiple dimensions.
Hire this AI Trainer
Sign in or create an account to invite AI Trainers to your job.
No subject matter listed
I have worked as a data annotation freelance over one year now. I am specialized in French, and more specifically in Swiss French. Projects that I have worked on and that I have delivered excellent results for include RLHF, prompt writing, LLMs evaluation, audio data collection. Through this experience, I have learnt how to write complex prompts that help improving the performance of AI models, on complex and localized topics. In addition, I have gained strong skills on how to evaluate the peformance of an AI model to answer different type of textual and imaged prompts.
Evaluate the performance of multiple models at answering complex prompts, including contextual documentation sometimes. Evaluation of multiple dimensions.
Write localized prompts in Swiss French language, aiming to have the model fail over one or multiple dimensions such as Localization, Instruction following, Truthfulness, writing quality
Masters, Biology
Senior Research Associate