Abstract
We explore anatomy of answers with respect to which text fragments from an answer are worth matching with a question and which should not be matched. We apply the Rhetorical Structure Theory to build a discourse tree of an answer and select elementary discourse units that are suitable for indexing. Manual rules for selection of these discourse units as well as automated classification based on web search engine mining are evaluated con-cerning improving search accuracy. We form two sets of question-answer pairs for FAQ and community QA search domains and use them for evaluation of the proposed indexing methodology, which delivers up to 16 percent improvement in search recall.- Anthology ID:
- R19-1043
- Volume:
- Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019)
- Month:
- September
- Year:
- 2019
- Address:
- Varna, Bulgaria
- Editors:
- Ruslan Mitkov, Galia Angelova
- Venue:
- RANLP
- SIG:
- Publisher:
- INCOMA Ltd.
- Note:
- Pages:
- 367–372
- Language:
- URL:
- https://preview.aclanthology.org/build-pipeline-with-new-library/R19-1043/
- DOI:
- 10.26615/978-954-452-056-4_043
- Cite (ACL):
- Boris Galitsky and Dmitry Ilvovsky. 2019. Two Discourse Tree - Based Approaches to Indexing Answers. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019), pages 367–372, Varna, Bulgaria. INCOMA Ltd..
- Cite (Informal):
- Two Discourse Tree - Based Approaches to Indexing Answers (Galitsky & Ilvovsky, RANLP 2019)
- PDF:
- https://preview.aclanthology.org/build-pipeline-with-new-library/R19-1043.pdf