You can find the main data card on the GEM Website .
This is a large task-oriented dialog dataset in which a model has to produce the response. The input contains the context and a structured representation of what the model is supposed to generate. The input is already pre-formatted as string, turning this into a pure text-to-text problem.
You can load the dataset via:
import datasets data = datasets.load_dataset('GEM/Taskmaster')
The data loader can be found here .
website paper authorsGoogle researchers
@article{byrne2020tickettalk, title={TicketTalk: Toward human-level performance with end-to-end, transaction-based dialog systems}, author={Byrne, Bill and Krishnamoorthi, Karthik and Ganesh, Saravanan and Kale, Mihir Sanjay}, journal={arXiv preprint arXiv:2012.12458}, year={2020} }Contact Name
Karthik Krishnamoorthi
Contact Emailkrishnamoorthi@google.com
Has a Leaderboard?no
no
Covered DialectsNA
Covered LanguagesEnglish
Whose Language?NA
Licensecc-by-4.0: Creative Commons Attribution 4.0 International
Intended UseDialogues
Primary TaskDialog Response Generation
Communicative Goala movie ticketing dialog dataset with 23,789 annotated conversations.
other
Curation Organization(s)NA
Dataset CreatorsGoogle researchers
FundingTosin Adewumi (Luleå University of Technology)
NA
How were labels chosen?NA
Example Instance{'context': "<PR>get_movie_attribute<PRAN>rating.movie<PRAV>rated R<C><U>I wanna see a movie<A>where are you?<U>spring hills kansas<PN>find_theaters<PAN>location<PAV>spring hills kansas<PR>find_theaters<PRAN>name.theater<PRAV>AMC Holiday Theater<PRAV>Cinemark Downtown<A>there are 2 theaters near you, the AMC Holiday Theater and Cinemark Downtown. Did you know which movie you'd like to see?<U>funny one please<PN>find_movies<PAN>location<PAV>spring hills kansas<PR>find_movies<PRAN>name.movie<PRAV>Not My Problem<PRAV>Family Jewels<PN>get_movie_attribute<PAN>name.movie<PAV>Not My Problem<PAN>attribute<PAV>name.genre<PR>get_movie_attribute<PRAN>name.genre<PRAV>comedy<PN>get_movie_attribute<PAN>name.movie<PAV>Not My Problem<PAN>attribute<PAV>name.person<PR>get_movie_attribute<PRAN>name.person<PRAV>Matt Damon<PN>get_movie_attribute<PAN>name.movie<PAV>Not My Problem<PAN>attribute<PAV>name.person<PR>get_movie_attribute<PRAN>name.person<PRAV>Noah Schnapp<PN>get_movie_attribute<PAN>name.movie<PAV>Family Jewels<PAN>attribute<PAV>name.genre<PR>get_movie_attribute<PRAN>name.genre<PRAV>romantic comedy<PN>get_movie_attribute<PAN>name.movie<PAV>Family Jewels<PAN>attribute<PAV>name.person<PR>get_movie_attribute<PRAN>name.person<PRAV>Melissa McCarthy<PN>get_movie_attribute<PAN>name.movie<PAV>Family Jewels<PAN>attribute<PAV>name.person<PR>get_movie_attribute<PRAN>name.person<PRAV>Ryan Reynolds<A>There's the comedy film called Not My Problem starring Matt Damon and Noah Schnapp. There's also a romantic comedy called Family Jewels starring Melissa McCarthy and Ryan Reynolds.<U>what ratings are there?<PN>get_movie_attribute<PAN>name.movie<PAV>Not My Problem<PAN>attribute<PAV>rating.movie<PR>get_movie_attribute<PRAN>rating.movie<PRAV>rated PG-13<PN>get_movie_attribute<PAN>name.movie<PAV>Family Jewels<PAN>attribute<PAV>rating.movie", 'conversation_id': 'dlg-d1f52e7e-c34c-4e85-b406-85ed138b5068', 'gem_id': 'Taskmaster-train-0', 'references': ['Not My Problem is rated PG-13 and Family Jewels is rated R.'], 'target': 'Not My Problem is rated PG-13 and Family Jewels is rated R.'}Data Splits
- train : 187182 examples - dev : 23406 examples - test : 23316 examples
Splitting CriteriaNA
NA
Dialogue generation that makes sense
Similar Datasetsyes
Unique Language Coverageno
Difference from other GEM datasetsNA
Ability that the Dataset measuresNA
yes
GEM Modificationsother
Modification Detailsgem_id field was added to the 3 data splits
Additional Splits?no
https://github.com/google-research-datasets/Taskmaster/tree/master/TM-3-2020
Technical TermsNA
BLEU: 60
MetricsBLEU
Proposed Evaluationautomatic evaluation
Previous results available?yes
Other Evaluation ApproachesNA
Relevant Previous ResultsNA
NA
Communicative Goala movie ticketing dialog dataset with 23,789 annotated conversations.
Sourced from Different Sourcesno
Crowdsourced
Where was it crowdsourced?Participatory experiment
Language ProducersNA
Topics CoveredTicketing
Data Validationnot validated
Was Data Filtered?not filtered
none
Annotation Service?no
no
Justification for Using the DataNA
no PII
Justification for no PIIIt's based on ticketing without personal information
no
no
no
unsure
Are the Language Producers Representative of the Language?NA
NA
open license - commercial use allowed
Copyright Restrictions on the Language Datapublic domain
NA
Unsuited ApplicationsNA
Discouraged Use CasesNA