The Touché23-ValueEval Dataset comprises 9324 arguments from six different sources. An arguments source is indicated with the first letter of its Argument ID :
The annotated labels are based on the value taxonomy published in Identifying the Human Values behind Arguments (Kiesel et al. 2022) at ACL'22.
[1] https://language.ml [2] https://en.wikipedia.org/wiki/Nahj_al-Balagha [3] https://en.wikipedia.org/wiki/Ghurar_al-Hikam_wa_Durar_al-Kalim
The default configuration name is main .
from datasets import load_dataset dataset = load_dataset("webis/Touche23-ValueEval") print(dataset['train'].info.description) for argument in iter(dataset['train']): print(f"{argument['Argument ID']}: {argument['Stance']} '{argument['Conclusion']}': {argument['Premise']}")
Human Value Detection
The Argument Instances are all monolingual; it only includes English (mostly en-US) documents. The Metadata Instances for some dataset parts additionally state the arguments in their original language and phrasing.
Each argument instance has the following attributes:
Additionally, the labels are separated into value-categories , aka. level 2 labels of the value taxonomy (Kiesel et al. 2022b), and human values , aka. level 1 labels of the value taxonomy. This distinction is also reflected in the configuration names:
labels = ["Self-direction: thought", "Self-direction: action", "Stimulation", "Hedonism", "Achievement", "Power: dominance", "Power: resources", "Face", "Security: personal", "Security: societal", "Tradition", "Conformity: rules", "Conformity: interpersonal", "Humility", "Benevolence: caring", "Benevolence: dependability", "Universalism: concern", "Universalism: nature", "Universalism: tolerance", "Universalism: objectivity"]
The configuration names (as replacements for <config> ) in this dataset are:
dataset_main_train = load_dataset("webis/Touche23-ValueEval", split="train") dataset_main_validation = load_dataset("webis/Touche23-ValueEval", split="validation") dataset_main_test = load_dataset("webis/Touche23-ValueEval", split="test")
dataset_nahjalbalagha_test = load_dataset("webis/Touche23-ValueEval", name="nahjalbalagha", split="test")
dataset_nyt_test = load_dataset("webis/Touche23-ValueEval", name="nyt", split="test")
dataset_zhihu_validation = load_dataset("webis/Touche23-ValueEval", name="zhihu", split="validation")
Please note that due to copyright reasons, there currently does not exist a direct download link to the arguments contained in the New york Times dataset. Accessing any of the nyt or nyt-level1 configurations will therefore use the specifically created nyt-downloader program to create and access the arguments locally. See the program's README for further details.
The following lists all configuration names for metadata. Each configuration only has a single split named meta .
dataset_ibm_metadata = load_dataset("webis/Touche23-ValueEval", name="ibm-meta", split="meta")
dataset_zhihu_metadata = load_dataset("webis/Touche23-ValueEval", name="zhihu-meta", split="meta")
dataset_gdi_metadata = load_dataset("webis/Touche23-ValueEval", name="gdi-meta", split="meta")
dataset_cofe_metadata = load_dataset("webis/Touche23-ValueEval", name="cofe-meta", split="meta")
dataset_nahjalbalagha_metadata = load_dataset("webis/Touche23-ValueEval", name="nahjalbalagha-meta", split="meta")
dataset_nyt_metadata = load_dataset("webis/Touche23-ValueEval", name="nyt-meta", split="meta")
{ "<value category>": { "<level 1 value>": [ "<exemplary effect a corresponding argument might target>", ... ], ... }, ... }As this configuration contains just a single entry, an example usage could be:
value_categories = load_dataset("webis/Touche23-ValueEval", name="value-categories", split="meta")[0]
[More Information Needed]
Creative Commons Attribution 4.0 International (CC BY 4.0)
@Article{mirzakhmedova:2023a, author = {Nailia Mirzakhmedova and Johannes Kiesel and Milad Alshomary and Maximilian Heinrich and Nicolas Handke\ and Xiaoni Cai and Valentin Barriere and Doratossadat Dastgheib and Omid Ghahroodi and {Mohammad Ali} Sadraei\ and Ehsaneddin Asgari and Lea Kawaletz and Henning Wachsmuth and Benno Stein}, doi = {10.48550/arXiv.2301.13771}, journal = {CoRR}, month = jan, publisher = {arXiv}, title = {{The Touch{\'e}23-ValueEval Dataset for Identifying Human Values behind Arguments}}, volume = {abs/2301.13771}, year = 2023 }