数据集:
argilla/comparison-data-falcon-with-feedback
大小:
1K<n<10KThis dataset has been created with Argilla .
As shown in the sections below, this dataset can be loaded into Argilla as explained in Load with Argilla , or used directly with the datasets library in Load with datasets .
This dataset contains:
A dataset configuration file conforming to the Argilla dataset format named argilla.cfg . This configuration file will be used to configure the dataset when using the FeedbackDataset.from_huggingface method in Argilla.
Dataset records in a format compatible with HuggingFace datasets . These records will be loaded automatically when using FeedbackDataset.from_huggingface and can be loaded independently using the datasets library via load_dataset .
The annotation guidelines that have been used for building and curating the dataset, if they've been defined in Argilla.
To load with Argilla, you'll just need to install Argilla as pip install argilla --upgrade and then use the following code:
import argilla as rg ds = rg.FeedbackDataset.from_huggingface("argilla/comparison-data-falcon-with-feedback")
To load this dataset with datasets , you'll just need to install datasets as pip install datasets --upgrade and then use the following code:
from datasets import load_dataset ds = load_dataset("argilla/comparison-data-falcon-with-feedback")
This dataset can contain multiple fields, questions and responses so it can be used for different NLP tasks, depending on the configuration. The dataset structure is described in the Dataset Structure section .
There are no leaderboards associated with this dataset.
[More Information Needed]
The dataset is created in Argilla with: fields , questions , and guidelines .
The fields are the dataset records themselves, for the moment just text fields are suppported. These are the ones that will be used to provide responses to the questions.
Field Name | Title | Type | Required | Markdown |
---|---|---|---|---|
instruction | User instruction | TextField | True | False |
response-1 | Response-1 | TextField | True | False |
response-2 | Response-2 | TextField | True | False |
The questions are the questions that will be asked to the annotators. They can be of different types, such as rating, text, single choice, or multiple choice.
Question Name | Title | Type | Required | Description | Values/Labels |
---|---|---|---|---|---|
choose-best | Choose the best response: | RatingQuestion | True | Choose the most helpful, harmless, and truthful response. Select 1 for response-1, 2 for response-2, or discard if both are equally good/bad. | [1, 2] |
Finally, the guidelines are just a plain string that can be used to provide instructions to the annotators. Find those in the annotation guidelines section.
An example of a dataset instance in Argilla looks as follows:
{ "external_id": "518", "fields": { "instruction": "What is Depreciation", "response-1": "Depreciation is the drop in value of an asset due to wear and tear, age and obsolescence (going out of date) as recorded in an organization\u0027s financial records.", "response-2": "What is Depreciation \u2013 10 Important Facts to Know?\nWhen a business buys a new asset, the purchase price of that asset is depreciated over time to reflect its usage and eventual obsolescence. Depreciation expense can be a tax deductible expense and is usually a non-cash expense reported on a company\u2019s income statement and balance sheet. The amount of depreciation expense a company reports each year is the difference between the original purchase price of the asset and what the current value of that asset might be. Here are 10 important facts to know about depreciation:\n1. Depreciation is a non-cash expense. It is an expense that is reported in a business\u2019s income statement and balance sheet and not a cash flow expense.\n2. Depreciation is an accounting standard and it is required to be disclosed in a business\u2019s financial statements.\n3. The amount of depreciation is usually a tax expense and not a cash expense reported on a company\u2019s income statement" }, "responses": [ { "status": "submitted", "user_id": null, "values": { "choose-best": { "value": 1 } } } ] }
While the same record in HuggingFace datasets looks as follows:
{ "choose-best": { "status": [ "submitted" ], "user_id": [ null ], "value": [ 1 ] }, "external_id": "518", "instruction": "What is Depreciation", "response-1": "Depreciation is the drop in value of an asset due to wear and tear, age and obsolescence (going out of date) as recorded in an organization\u0027s financial records.", "response-2": "What is Depreciation \u2013 10 Important Facts to Know?\nWhen a business buys a new asset, the purchase price of that asset is depreciated over time to reflect its usage and eventual obsolescence. Depreciation expense can be a tax deductible expense and is usually a non-cash expense reported on a company\u2019s income statement and balance sheet. The amount of depreciation expense a company reports each year is the difference between the original purchase price of the asset and what the current value of that asset might be. Here are 10 important facts to know about depreciation:\n1. Depreciation is a non-cash expense. It is an expense that is reported in a business\u2019s income statement and balance sheet and not a cash flow expense.\n2. Depreciation is an accounting standard and it is required to be disclosed in a business\u2019s financial statements.\n3. The amount of depreciation is usually a tax expense and not a cash expense reported on a company\u2019s income statement" }
Among the dataset fields, we differentiate between the following:
Fields: These are the dataset records themselves, for the moment just text fields are suppported. These are the ones that will be used to provide responses to the questions.
Questions: These are the questions that will be asked to the annotators. They can be of different types, such as rating, text, single choice, or multiple choice.
Additionally, we also have one more field which is optional and is the following:
The dataset contains a single split, which is train .
[More Information Needed]
[More Information Needed]
Who are the source language producers?[More Information Needed]
These guidelines are based on the paper Training Language Models to Follow Instructions with Human Feedback
You are given a text-based description of a task, submitted by a user.
This task description may be in the form of an explicit instruction (e.g. "Write a story about a wise frog."). The task may also be specified indirectly, for example by using several examples of the desired behavior (e.g. given a sequence of movie reviews followed by their sentiment, followed by one movie review without a sentiment, you can assume the task is to predict the sentiment of the final review), or by generating the start of a desired output (e.g. given "There once was a wise frog named Julius," you can assume the task is to continue the story).
You will also be given several text outputs, intended to help the user with their task. Your job is to evaluate these outputs to ensure that they are helpful, truthful, and harmless, and select the best according to these criteria.
In case none of the outputs is correct, conforming to the criteria, or they're exactly equal, use the discard option to not rank them .
For most tasks, being truthful and harmless is more important than being helpful.
By helpful , we mean that the output should follow the user's intention and help the user solve their task. Some examples of helpful behavior:
By truthful , we mean that the output contains accurate information and doesn't mislead the user. Some examples of truthful behavior:
By harmless , we mean that the output should not cause physical, psychological, or social harm to people; damage to or loss of equipment or property; damage to the environment; or harm to institutions or resources necessary to human well-being. Some examples of harmless behavior:
Evaluating model outputs may involve making trade-offs between these criteria. These trade-offs will depend on the task.
Annotation process[More Information Needed]
Who are the annotators?[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]