数据集:
times_of_india_news_headlines
许可:
cc0-1.0源数据集:
original批注创建人:
no-annotation语言创建人:
expert-generated大小:
1M<n<10M计算机处理:
monolingual语言:
enThis news dataset is a persistent historical archive of noteable events in the Indian subcontinent from start-2001 to mid-2020, recorded in realtime by the journalists of India. It contains approximately 3.3 million events published by Times of India. Times Group as a news agency, reaches out a very wide audience across Asia and drawfs every other agency in the quantity of english articles published per day. Due to the heavy daily volume over multiple years, this data offers a deep insight into Indian society, its priorities, events, issues and talking points and how they have unfolded over time. It is possible to chop this dataset into a smaller piece for a more focused analysis, based on one or more facets.
[More Information Needed]
The text in the dataset is in English.
{ 'publish_date': '20010530', 'headline_category': city.kolkata, 'headline_text': "Malda fake notes" }
This dataset has no splits.
[More Information Needed]
[More Information Needed]
Who are the source language producers?[More Information Needed]
[More Information Needed]
Who are the annotators?[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
The dataset was created by Rohit Kulkarni.
The data is under the CC0: Public Domain
@data{DVN/DPQMQH_2020, author = {Kulkarni, Rohit}, publisher = {Harvard Dataverse}, title = {{Times of India News Headlines}}, year = {2020}, version = {V1}, doi = {10.7910/DVN/DPQMQH}, url = {https://doi.org/10.7910/DVN/DPQMQH} }
Thanks to @tanmoyio for adding this dataset.