数据集:

KTH/hungarian-single-speaker-tts

任务:

文本转语音

task_categories:other

语言:

计算机处理:

monolingual

大小:

1K<n<10K

批注创建人:

expert-generated

源数据集:

original

预印本库:

arxiv:1903.11269

许可:

cc0-1.0

数据集介绍文件清单

中文

Dataset Card for CSS10 Hungarian: Single Speaker Speech Dataset

Dataset Summary

The corpus consists of a single speaker, with 4515 segments extracted from a single LibriVox audiobook.

Supported Tasks and Leaderboards

[Needs More Information]

Languages

The audio is in Hungarian.

Dataset Structure

[Needs More Information]

Data Instances

[Needs More Information]

Data Fields

[Needs More Information]

Data Splits

[Needs More Information]

Dataset Creation

Curation Rationale

CSS10 is a collection of single speaker speech datasets for 10 languages. Each of them consists of audio files recorded by a single volunteer and their aligned text sourced from LibriVox.

Source Data

Initial Data Collection and Normalization

Egri csillagok , read by Diana Majlinger.

Who are the source language producers?

[Needs More Information]

Annotations

Annotation process

[Needs More Information]

Who are the annotators?

[Needs More Information]

Personal and Sensitive Information

[Needs More Information]

Considerations for Using the Data

Social Impact of Dataset

[More Information Needed]

Discussion of Biases

[More Information Needed]

Other Known Limitations

[Needs More Information]

Additional Information

Dataset Curators

Kyubyong Park & Tommy Mulc

Licensing Information

CC0: Public Domain

Citation Information

@article{park2019css10,
  title={CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages},
  author={Park, Kyubyong and Mulc, Thomas},
  journal={Interspeech},
  year={2019}
}

Contributions

[Needs More Information]

作者:

KTH

数据集大小:

2.3 GB