数据集:

NbAiLab/norwegian_parliament

语言:

no

计算机处理:

monolingual

大小:

1K<n<10K

语言创建人:

found

批注创建人:

expert-generated

源数据集:

original

许可:

cc-by-4.0
中文

Dataset Card Creation Guide

Dataset Summary

The Norwegian Parliament Speeches is a collection of text passages from 1998 to 2016 and pronounced at the Norwegian Parliament (Storting) by members of the two major parties: Fremskrittspartiet and Sosialistisk Venstreparti. The dataset is annotated with the party the speaker was associated with at the time (dates of speeches are also included).

Supported Tasks and Leaderboards

[More Information Needed]

Languages

The text in the dataset is in Norwegian.

Dataset Structure

Data Instances

Example of one instance in the dataset.

{'label': 0, 'text': 'Verre er det med slagsmålene .'}

Data Fields

  • id : index of the example
  • text : Text of a speech
  • date : Date ( YYYY-MM-DD ) the speech was produced
  • label : Political party the speaker was associated with at the time
    • 0 = Fremskrittspartiet
    • 1 = Sosialistisk Venstreparti

Data Splits

The dataset is split into a train , validation , and test split with the following sizes:

Tain Valid Test
Number of examples 3600 1200 1200

The dataset is balanced on political party.

Dataset Creation

This dataset is based on the publicly available information by Norwegian Parliament (Storting) and created by the National Library of Norway AI-Lab to benchmark their language models.

Additional Information

Licensing Information

This work is licensed under a Creative Commons Attribution 4.0 International License

Citation Information

@misc{--,
      title={--},
      author={--},
      year={2021},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}