数据集:

hausa_voa_topics

语言:

ha

计算机处理:

monolingual

大小:

1K<n<10K

语言创建人:

found

批注创建人:

expert-generated

源数据集:

original
中文

Dataset Card for Hausa VOA News Topic Classification dataset (hausa_voa_topics)

Dataset Summary

A news headline topic classification dataset, similar to AG-news, for Hausa. The news headlines were collected from VOA Hausa .

Supported Tasks and Leaderboards

[More Information Needed]

Languages

Hausa (ISO 639-1: ha)

Dataset Structure

Data Instances

An instance consists of a news title sentence and the corresponding topic label.

Data Fields

  • news_title : A news title
  • label : The label describing the topic of the news title. Can be one of the following classes: Nigeria, Africa, World, Health or Politics.

Data Splits

[More Information Needed]

Dataset Creation

Curation Rationale

[More Information Needed]

Source Data

Initial Data Collection and Normalization

[More Information Needed]

Who are the source language producers?

[More Information Needed]

Annotations

Annotation process

[More Information Needed]

Who are the annotators?

[More Information Needed]

Personal and Sensitive Information

[More Information Needed]

Considerations for Using the Data

Social Impact of Dataset

[More Information Needed]

Discussion of Biases

[More Information Needed]

Other Known Limitations

[More Information Needed]

Additional Information

Dataset Curators

[More Information Needed]

Licensing Information

[More Information Needed]

Citation Information

[More Information Needed]

Contributions

Thanks to @michael-aloys for adding this dataset.