数据集:
scaredmeow/shopee-reviews-tl-binary
This dataset card aims to be a base template for new datasets. It has been generated using this raw template .
[More Information Needed]
[More Information Needed]
A typical data point, comprises of a text and the corresponding label.
An example from the YelpReviewFull test set looks as follows:
{ 'label': pos, 'text': 'Huyyy ang gandaaaaaaaaaaa. Grabe sobrang ganda talaga wala ako masabi. Complete orders pa pinadala sa akin. Buti hindi nabasag kahit walang bubble wrap. Okay na lang din para save mother earth and at least hindi nabasag hehe. Oorder ulit ako ang ganda eh' }
The Shopee reviews tl binary dataset is constructed by randomly taking 14000 training samples and 3000 samples for testing and validation for each review star from neg and pos. In total there are 28000 training samples and 6000 each in validation and testing samples.
[More Information Needed]
[More Information Needed]
Who are the source language producers?[More Information Needed]
[More Information Needed]
Who are the annotators?[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]