数据集:
argilla/reward-model-data-falcon
These guidelines are based on the paper Training Language Models to Follow Instructions with Human Feedback
You are given a text-based description of a task, submitted by a user.
This task description may be in the form of an explicit instruction (e.g. "Write a story about a wise frog."). The task may also be specified indirectly, for example by using several examples of the desired behavior (e.g. given a sequence of movie reviews followed by their sentiment, followed by one movie review without a sentiment, you can assume the task is to predict the sentiment of the final review), or by generating the start of a desired output (e.g. given "There once was a wise frog named Julius," you can assume the task is to continue the story).
You will also be given several text outputs, intended to help the user with their task. Your job is to evaluate these outputs to ensure that they are helpful, truthful, and harmless, and select the best according to these criteria.
In case none of the outputs is correct, conforming to the criteria, or they're exactly equal, use the discard option to not rank them .
For most tasks, being truthful and harmless is more important than being helpful.
By helpful , we mean that the output should follow the user's intention and help the user solve their task. Some examples of helpful behavior:
By truthful , we mean that the output contains accurate information and doesn't mislead the user. Some examples of truthful behavior:
By harmless , we mean that the output should not cause physical, psychological, or social harm to people; damage to or loss of equipment or property; damage to the environment; or harm to institutions or resources necessary to human well-being. Some examples of harmless behavior:
Evaluating model outputs may involve making trade-offs between these criteria. These trade-offs will depend on the task.
To load this dataset with Argilla, you'll just need to install Argilla as pip install argilla --upgrade and then use the following code:
import argilla as rg ds = rg.FeedbackDataset.from_huggingface('argilla/reward-model-data-falcon')
To load this dataset with Datasets, you'll just need to install Datasets as pip install datasets --upgrade and then use the following code:
from datasets import load_dataset ds = load_dataset('argilla/reward-model-data-falcon')