Abstract
Biomedical event extraction is a crucial task in order to automatically extract information from the increasingly growing body of biomedical literature. Despite advances in the methods in recent years, most event extraction systems are still evaluated in-domain and on complete event structures only. This makes it hard to determine the performance of intermediate stages of the task, such as edge detection, across different corpora. Motivated by these limitations, we present the first cross-domain study of edge detection for biomedical event extraction. We analyze differences between five existing gold standard corpora, create a standardized benchmark corpus, and provide a strong baseline model for edge detection. Experiments show a large drop in performance when the baseline is applied on out-of-domain data, confirming the need for domain adaptation methods for the task. To encourage research efforts in this direction, we make both the data and the baseline available to the research community: https://www.cosbi.eu/cfx/9985.- Anthology ID:
- 2020.lrec-1.244
- Volume:
- Proceedings of the Twelfth Language Resources and Evaluation Conference
- Month:
- May
- Year:
- 2020
- Address:
- Marseille, France
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association
- Note:
- Pages:
- 1982–1989
- Language:
- English
- URL:
- https://aclanthology.org/2020.lrec-1.244
- DOI:
- Cite (ACL):
- Alan Ramponi, Barbara Plank, and Rosario Lombardo. 2020. Cross-Domain Evaluation of Edge Detection for Biomedical Event Extraction. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 1982–1989, Marseille, France. European Language Resources Association.
- Cite (Informal):
- Cross-Domain Evaluation of Edge Detection for Biomedical Event Extraction (Ramponi et al., LREC 2020)
- PDF:
- https://preview.aclanthology.org/remove-xml-comments/2020.lrec-1.244.pdf