RLHF prompt writer, response evaluator, and rewriter
In this RLHF LLM training project, it is my responsibility to create prompts that belong to a specific category (business, health, natural sciences, etc.) with enough natural constraints that at least one of the two responses generated contain a major or minor issue across specific dimensions. The dimensions which must be taken into consideration and labelled as Major issue, minor issue or no issue are Localization (in this case Spain), Instruction following, truthfulness, verbosity, writing quality, and harmlessness. After rating the responses across these dimensions, I must choose which of the two are better, according to their individual ratings. If both achieve the same rating, then I must rewrite the one I believe is better based on the weight of the issues contained in each.