数据集:
sst
任务:
文本分类语言:
en计算机处理:
monolingual语言创建人:
found批注创建人:
crowdsourced源数据集:
original许可:
license:unknownThe Stanford Sentiment Treebank is the first corpus with fully labeled parse trees that allows for a complete analysis of the compositional effects of sentiment in language.
The text in the dataset is in English
For the default configuration:
{'label': 0.7222200036048889, 'sentence': 'Yet the act is still charming here .', 'tokens': 'Yet|the|act|is|still|charming|here|.', 'tree': '15|13|13|10|9|9|11|12|10|11|12|14|14|15|0'}
For the dictionary configuration:
{'label': 0.7361099720001221, 'phrase': 'still charming'}
For the ptb configuration:
{'ptb_tree': '(3 (2 Yet) (3 (2 (2 the) (2 act)) (3 (4 (3 (2 is) (3 (2 still) (4 charming))) (2 here)) (2 .))))'}
The set of complete sentences (both default and ptb configurations) is split into a training, validation and test set. The dictionary configuration has only one split as it is used for reference rather than for learning.
[Needs More Information]
[Needs More Information]
Who are the source language producers?Rotten Tomatoes reviewers.
[Needs More Information]
Who are the annotators?[Needs More Information]
[Needs More Information]
[Needs More Information]
[Needs More Information]
[Needs More Information]
[Needs More Information]
[Needs More Information]
@inproceedings{socher-etal-2013-recursive, title = "Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank", author = "Socher, Richard and Perelygin, Alex and Wu, Jean and Chuang, Jason and Manning, Christopher D. and Ng, Andrew and Potts, Christopher", booktitle = "Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing", month = oct, year = "2013", address = "Seattle, Washington, USA", publisher = "Association for Computational Linguistics", url = "https://www.aclweb.org/anthology/D13-1170", pages = "1631--1642", }
Thanks to @patpizio for adding this dataset.