The DARE Corpus: A Resource for Anaphora Resolution in Dialogue Based Intelligent Tutoring Systems

Nobal Niraula, Vasile Rus, Rajendra Banjade, Dan Stefanescu, William Baggett, Brent Morgan


Abstract
We describe the DARE corpus, an annotated data set focusing on pronoun resolution in tutorial dialogue. Although data sets for general purpose anaphora resolution exist, they are not suitable for dialogue based Intelligent Tutoring Systems. To the best of our knowledge, no data set is currently available for pronoun resolution in dialogue based intelligent tutoring systems. The described DARE corpus consists of 1,000 annotated pronoun instances collected from conversations between high-school students and the intelligent tutoring system DeepTutor. The data set is publicly available.
Anthology ID:
L14-1320
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3199–3203
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/372_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Nobal Niraula, Vasile Rus, Rajendra Banjade, Dan Stefanescu, William Baggett, and Brent Morgan. 2014. The DARE Corpus: A Resource for Anaphora Resolution in Dialogue Based Intelligent Tutoring Systems. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 3199–3203, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
The DARE Corpus: A Resource for Anaphora Resolution in Dialogue Based Intelligent Tutoring Systems (Niraula et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/372_Paper.pdf