RLHF data training in German
Scope of the Project: Reinforced Learning from human development in several languages including German, Austrian German, Swiss German for trainin of an LLM. No futher information was provided. Specific data labeling taks: evaluating 2 responses in the dimensions instruction following, truth, localization, writing & style, verbosity and harmfulnes on a scale from 1 to 3 and writing justifications. Overallscore from 1 to 5. Comparing the responses against each other and writing a justification for selecting the better response. Procet size (what is known): Several languages Englisch, German, Austrian German, Swiss German. above 100 contributors Quality measures: Training tasks and feedback once in a while.