Investigating Domain-Specific Information for Neural Coreference Resolution on Biomedical Texts
Hai-Long Trieu, Nhung T. H. Nguyen, Makoto Miwa, Sophia Ananiadou
Abstract
Existing biomedical coreference resolution systems depend on features and/or rules based on syntactic parsers. In this paper, we investigate the utility of the state-of-the-art general domain neural coreference resolution system on biomedical texts. The system is an end-to-end system without depending on any syntactic parsers. We also investigate the domain specific features to enhance the system for biomedical texts. Experimental results on the BioNLP Protein Coreference dataset and the CRAFT corpus show that, with no parser information, the adapted system compared favorably with the systems that depend on parser information on these datasets, achieving 51.23% on the BioNLP dataset and 36.33% on the CRAFT corpus in F1 score. In-domain embeddings and domain-specific features helped improve the performance on the BioNLP dataset, but they did not on the CRAFT corpus.- Anthology ID:
- W18-2324
- Volume:
- Proceedings of the BioNLP 2018 workshop
- Month:
- July
- Year:
- 2018
- Address:
- Melbourne, Australia
- Venue:
- BioNLP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 183–188
- Language:
- URL:
- https://aclanthology.org/W18-2324
- DOI:
- 10.18653/v1/W18-2324
- Cite (ACL):
- Hai-Long Trieu, Nhung T. H. Nguyen, Makoto Miwa, and Sophia Ananiadou. 2018. Investigating Domain-Specific Information for Neural Coreference Resolution on Biomedical Texts. In Proceedings of the BioNLP 2018 workshop, pages 183–188, Melbourne, Australia. Association for Computational Linguistics.
- Cite (Informal):
- Investigating Domain-Specific Information for Neural Coreference Resolution on Biomedical Texts (Trieu et al., BioNLP 2018)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/W18-2324.pdf