数据集:
wongnai_reviews
任务:
文本分类语言:
th计算机处理:
monolingual大小:
10K<n<100K语言创建人:
found批注创建人:
found源数据集:
original许可:
lgpl-3.0The Wongnai Review dataset contains restaurant reviews and ratings, almost entirely in Thai language.
The reviews are in 5 classes ranging from 1 to 5 stars.
This dataset was featured in a Kaggle challenge https://www.kaggle.com/c/wongnai-challenge-review-rating-prediction/overview
Thai
Designated train (40,000 reviews) and test (6,204) sets.
Data was collected by Wongnai from business reviews on their website, and shared on GitHub and Kaggle.
The reviews are users' own star ratings, so no additional annotation was needed.
Contributors to original GitHub repo:
LGPL-3.0
See https://github.com/wongnai/wongnai-corpus
Thanks to @mapmeld , @cstorm125 for adding this dataset.