数据集:
bond005/sberdevices_golos_10h_crowd
语言:
ru计算机处理:
monolingual大小:
10K<n<100K批注创建人:
expert-generated源数据集:
extended预印本库:
arxiv:2106.10161许可:
otherSberdevices Golos is a corpus of approximately 1200 hours of 16kHz Russian speech from crowd (reading speech) and farfield (communication with smart devices) domains, prepared by SberDevices Team (Alexander Denisenko, Angelina Kovalenko, Fedor Minkin, and Nikolay Karpov). The data is derived from the crowd-sourcing platform, and has been manually annotated.
Authors divide all dataset into train and test subsets. The training subset includes approximately 1000 hours. For experiments with a limited number of records, authors identified training subsets of shorter length: 100 hours, 10 hours, 1 hour, 10 minutes.
This dataset is a simpler version of the above mentioned Golos:
The audio is in Russian.
A typical data point comprises the audio data, usually called audio and its transcription, called transcription . Any additional information about the speaker and the passage which contains the transcription is not provided.
{'audio': {'path': None, 'array': array([ 3.05175781e-05, 3.05175781e-05, 0.00000000e+00, ..., -1.09863281e-03, -7.93457031e-04, -1.52587891e-04]), dtype=float64), 'sampling_rate': 16000}, 'transcription': 'шестнадцатая часть сезона пять сериала лемони сникет тридцать три несчастья'}
This dataset is a simpler version of the original Golos:
Train | Validation | Test | |
---|---|---|---|
examples | 7993 | 793 | 9994 |
hours | 8.9h | 0.9h | 11.2h |
[Needs More Information]
[Needs More Information]
Who are the source language producers?[Needs More Information]
All recorded audio files were manually annotated on the crowd-sourcing platform.
Who are the annotators?[Needs More Information]
The dataset consists of people who have donated their voice. You agree to not attempt to determine the identity of speakers in this dataset.
[More Information Needed]
[More Information Needed]
[Needs More Information]
The dataset was initially created by Alexander Denisenko, Angelina Kovalenko, Fedor Minkin, and Nikolay Karpov.
Public license with attribution and conditions reserved
@misc{karpov2021golos, author = {Karpov, Nikolay and Denisenko, Alexander and Minkin, Fedor}, title = {Golos: Russian Dataset for Speech Research}, publisher = {arXiv}, year = {2021}, url = {https://arxiv.org/abs/2106.10161} }
Thanks to @bond005 for adding this dataset.