Evaluation
I review pre-written prompts along with two corresponding responses, evaluate each response, provide feedback that reflects human reasoning, and determine which one is superior.
Hire this AI Trainer
Sign in or create an account to invite AI Trainers to your job.
No subject matter listed
My AI training skills cover a wide range of areas, including RLHF, evaluation and rating of two-model responses, audio recording, annotation, transcription, and SFT. My strongest skills are in Evaluation and RLHF. I specialize in designing effective prompt engineering and creating rubrics that optimize model learning.
I review pre-written prompts along with two corresponding responses, evaluate each response, provide feedback that reflects human reasoning, and determine which one is superior.
I recorded short audio clips of about 20 seconds according to assigned categories, transcribed the recordings, classified them based on their purpose (such as whether they were knowledge-seeking or casual conversation), and evaluated their noise levels.
In a project aimed at preventing the model from generating harmful outputs, I created prompts designed to elicit such harmful responses, evaluated the model’s outputs, and wrote ideal responses that demonstrated safe and appropriate behavior.
I have created prompts across a wide range of categories such as OpenQA, Summarization, and Creative Writing, covering diverse fields including social sciences, physics, and travel, and evaluated responses from two different models for each.
expert, make music, sound effect, voice recording, play instruments
Bachelor of Science, Physics, Earth Science
Sales Representative