数据集:
argilla/databricks-dolly-15k-curated-en
In this dataset, you will find a collection of records that show a category, an instruction, a context and a response to that instruction. The aim of the project is to correct the instructions, intput and responses to make sure they are of the highest quality and that they match the task category that they belong to. All three texts should be clear and include real information. In addition, the response should be as complete but concise as possible.
To curate the dataset, you will need to provide an answer to the following text fields:
1 - Final instruction: The final version of the instruction field. You may copy it using the copy icon in the instruction field. Leave it as it is if it's ok or apply any necessary corrections. Remember to change the instruction if it doesn't represent well the task category of the record.
2 - Final context: The final version of the instruction field. You may copy it using the copy icon in the context field. Leave it as it is if it's ok or apply any necessary corrections. If the task category and instruction don't need of an context to be completed, leave this question blank.
3 - Final response: The final version of the response field. You may copy it using the copy icon in the response field. Leave it as it is if it's ok or apply any necessary corrections. Check that the response makes sense given all the fields above.
You will need to provide at least an instruction and a response for all records. If you are not sure about a record and you prefer not to provide a response, click Discard.
To load this dataset with Argilla, you'll just need to install Argilla as pip install argilla --upgrade and then use the following code:
import argilla as rg ds = rg.FeedbackDataset.from_huggingface('argilla/databricks-dolly-15k-curated-en')
To load this dataset with Datasets, you'll just need to install Datasets as pip install datasets --upgrade and then use the following code:
from datasets import load_dataset ds = load_dataset('argilla/databricks-dolly-15k-curated-en')