Dataset Card for "cqadupstack"

Dataset Summary

This is a preprocessed version of fiqa, to make it easily consumable via huggingface. The original dataset can be found here .

The growing maturity of Natural Language Processing (NLP) techniques and resources is drastically changing the landscape of many application domains which are dependent on the analysis of unstructured data at scale. The financial domain, with its dependency on the interpretation of multiple unstructured and structured data sources and with its demand for fast and comprehensive decision making is already emerging as a primary ground for the experimentation of NLP, Web Mining and Information Retrieval (IR) techniques. This challenge focuses on advancing the state-of-the-art of aspect-based sentiment analysis and opinion-based Question Answering for the financial domain.

Dataset Structure

Data Instances

An example of 'train' looks as follows.

{
    "question": "How does a 2 year treasury note work?",
    "answer": "Notes and Bonds sell at par (1.0). When rates go up, their value goes down. When rates go down, their value goes up. ..."
}

Data Fields

The data fields are the same among all splits.

question : a string feature.
answer : a string feature.

Additional Information

Licensing Information

This dataset is distributed under the CC BY-NC licence providing free access for non-commercial and academic usage.

作者:

LLukas22

数据集大小:

18.34 MB