Constructing a Question Corpus for Textual Semantic Relations

Rui Wang, Shuguang Li


Abstract
Finding useful questions is a challenging task in Community Question Answering (CQA). There are two key issues need to be resolved: 1) what is a useful question to the given reference question; and furthermore 2) what kind of relations exist between a given pair of questions. In order to answer these two questions, in this paper, we propose a fine-grained inventory of textual semantic relations between questions and annotate a corpus constructed from the WikiAnswers website. We also extract large archives of question pairs with user-generated links and use them as labeled data for separating useful questions from neutral ones, achieving 72.2% of accuracy. We find such online CQA repositories valuable resources for related research.
Anthology ID:
L12-1606
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
4092–4097
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/1017_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Rui Wang and Shuguang Li. 2012. Constructing a Question Corpus for Textual Semantic Relations. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 4092–4097, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Constructing a Question Corpus for Textual Semantic Relations (Wang & Li, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/1017_Paper.pdf