Skip to content
This repository has been archived by the owner on Jul 18, 2024. It is now read-only.

IBM/sciqa-arcade198-dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

AI2 Reasoning Challenge Annotated Dataset (ARCADE198)

This is the human-annotated AI2 Reasoning Challenge (ARC) dataset (ARCADE198) from the following paper:

A Systematic Classification of Knowledge, Reasoning, and Context within the ARC Dataset 
Boratko, M.; Padigela, H.; Mikkilineni, D.; Yuvraj, P.; Das, R.; McCallum, A.; Chang, M.; Fokoue, A.; Kapanipathi, P.; Mattei, N.; Musa, R.; Talamadupula, K.; and Witbrock, M.
ACL 2018 Machine Reading for Question Answering (MRQA) Workshop

The ARCADE198 dataset was generated using the annotation system from:

An Interface for Annotating Science Questions 
Boratko, M.; Padigela, H.; Mikkilineni, D.; Yuvraj, P.; Das, R.; McCallum, A.; Chang, M.; Fokoue, A.; Kapanipathi, P.; Mattei, N.; Musa, R.; Talamadupula, K.; and Witbrock, M.
EMNLP 2018 System Demonstration Program.

Use of the ARCADE198 Dataset

To use this dataset, please:

  • Cite the two papers above, using the following bib-entries:
@inproceedings{BoPaMiYu18,
Author = {M. Boratko and H. Padigela and D. Mikkilineni and P. Yuvraj and R. Das and A. McCallum and M. Chang and A. Fokoue-Nkoutche and P. Kapanipathi and N. Mattei and R. Musa and K. Talamadupula and M. Witbrock},
Booktitle = {{Proceedings of the Machine Reading for Question Answering (MRQA) Workshop at ACL 2018}},
Date-Added = {2018-06-06 19:16:13 +0000},
Date-Modified = {2018-06-06 19:18:45 +0000},
Title = {A Systematic Classification of Knowledge, Reasoning, and Context within the ARC Dataset},
Year = {2018}}
@inproceedings{BoPaMiYu18-2,
	Author = {M. Boratko and H. Padigela and D. Mikkilineni and P. Yuvraj and R. Das and A. McCallum and M. Chang and A. Fokoue-Nkoutche and P. Kapanipathi and N. Mattei and R. Musa and K. Talamadupula and M. Witbrock},
	Booktitle = {{Proceedings of the Empirical Methods in Natural Language Processing (EMNLP) 2018 System Demonstration Program}},
	Title = {An Interface for Annotating Science Questions},
	Year = {2018}}

Link to Dataset

Please download here: ARCADE198 Dataset

Blogpost

Here is a blogpost that describes the dataset, and talks a bit more about the associated work.