Two Discourse Tree - Based Approaches to Indexing Answers

Boris Galitsky, Dmitry Ilvovsky


Abstract
We explore anatomy of answers with respect to which text fragments from an answer are worth matching with a question and which should not be matched. We apply the Rhetorical Structure Theory to build a discourse tree of an answer and select elementary discourse units that are suitable for indexing. Manual rules for selection of these discourse units as well as automated classification based on web search engine mining are evaluated con-cerning improving search accuracy. We form two sets of question-answer pairs for FAQ and community QA search domains and use them for evaluation of the proposed indexing methodology, which delivers up to 16 percent improvement in search recall.
Anthology ID:
R19-1043
Volume:
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019)
Month:
September
Year:
2019
Address:
Varna, Bulgaria
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
367–372
Language:
URL:
https://aclanthology.org/R19-1043
DOI:
10.26615/978-954-452-056-4_043
Bibkey:
Cite (ACL):
Boris Galitsky and Dmitry Ilvovsky. 2019. Two Discourse Tree - Based Approaches to Indexing Answers. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019), pages 367–372, Varna, Bulgaria. INCOMA Ltd..
Cite (Informal):
Two Discourse Tree - Based Approaches to Indexing Answers (Galitsky & Ilvovsky, RANLP 2019)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/R19-1043.pdf