Learning Open Domain Multi-hop Search Using Reinforcement Learning

Enrique Noriega-Atala; Mihai Surdeanu; Clayton Morrison

doi:10.18653/v1/2022.suki-1.4

Learning Open Domain Multi-hop Search Using Reinforcement Learning

Enrique Noriega-Atala, Mihai Surdeanu, Clayton Morrison

Abstract

We propose a method to teach an automated agent to learn how to search for multi-hop paths of relations between entities in an open domain. The method learns a policy for directing existing information retrieval and machine reading resources to focus on relevant regions of a corpus. The approach formulates the learning problem as a Markov decision process with a state representation that encodes the dynamics of the search process and a reward structure that minimizes the number of documents that must be processed while still finding multi-hop paths. We implement the method in an actor-critic reinforcement learning algorithm and evaluate it on a dataset of search problems derived from a subset of English Wikipedia. The algorithm finds a family of policies that succeeds in extracting the desired information while processing fewer documents compared to several baseline heuristic algorithms.

Anthology ID:: 2022.suki-1.4
Volume:: Proceedings of the Workshop on Structured and Unstructured Knowledge Integration (SUKI)
Month:: July
Year:: 2022
Address:: Seattle, USA
Editors:: Wenhu Chen, Xinyun Chen, Zhiyu Chen, Ziyu Yao, Michihiro Yasunaga, Tao Yu, Rui Zhang
Venue:: SUKI
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 26–35
Language:
URL:: https://preview.aclanthology.org/add-emnlp-2024-awards/2022.suki-1.4/
DOI:: 10.18653/v1/2022.suki-1.4
Bibkey:
Cite (ACL):: Enrique Noriega-Atala, Mihai Surdeanu, and Clayton Morrison. 2022. Learning Open Domain Multi-hop Search Using Reinforcement Learning. In Proceedings of the Workshop on Structured and Unstructured Knowledge Integration (SUKI), pages 26–35, Seattle, USA. Association for Computational Linguistics.
Cite (Informal):: Learning Open Domain Multi-hop Search Using Reinforcement Learning (Noriega-Atala et al., SUKI 2022)
Copy Citation:
PDF:: https://preview.aclanthology.org/add-emnlp-2024-awards/2022.suki-1.4.pdf
Video:: https://preview.aclanthology.org/add-emnlp-2024-awards/2022.suki-1.4.mp4
Data: WikiHop

PDF Cite Search Video Fix data