Deep Learning for Biomedical Information Retrieval: Learning Textual Relevance from Click Logs

Sunil Mohan, Nicolas Fiorini, Sun Kim, Zhiyong Lu


Abstract
We describe a Deep Learning approach to modeling the relevance of a document’s text to a query, applied to biomedical literature. Instead of mapping each document and query to a common semantic space, we compute a variable-length difference vector between the query and document which is then passed through a deep convolution stage followed by a deep regression network to produce the estimated probability of the document’s relevance to the query. Despite the small amount of training data, this approach produces a more robust predictor than computing similarities between semantic vector representations of the query and document, and also results in significant improvements over traditional IR text factors. In the future, we plan to explore its application in improving PubMed search.
Anthology ID:
W17-2328
Volume:
BioNLP 2017
Month:
August
Year:
2017
Address:
Vancouver, Canada,
Venue:
BioNLP
SIG:
SIGBIOMED
Publisher:
Association for Computational Linguistics
Note:
Pages:
222–231
Language:
URL:
https://aclanthology.org/W17-2328
DOI:
10.18653/v1/W17-2328
Bibkey:
Cite (ACL):
Sunil Mohan, Nicolas Fiorini, Sun Kim, and Zhiyong Lu. 2017. Deep Learning for Biomedical Information Retrieval: Learning Textual Relevance from Click Logs. In BioNLP 2017, pages 222–231, Vancouver, Canada,. Association for Computational Linguistics.
Cite (Informal):
Deep Learning for Biomedical Information Retrieval: Learning Textual Relevance from Click Logs (Mohan et al., BioNLP 2017)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/W17-2328.pdf