Abstract
Event extraction (EE), as a crucial information extraction (IE) task, aims to identify event triggers and their associated arguments from unstructured text, subsequently classifying them into pre-defined types and roles. In the biomedical domain, EE is widely used to extract complex structures representing biological events from literature. Due to the complicated semantics and specialized domain knowledge, it is challenging to construct biomedical event extraction datasets. Additionally, most existing biomedical EE datasets primarily focus on cell experiments or the overall experimental procedures. Therefore, we introduce AniEE, an event extraction dataset concentrated on the animal experiment stage. We establish a novel animal experiment customized entity and event scheme in collaboration with domain experts. We then create an expert-annotated high-quality dataset containing discontinuous entities and nested events and evaluate our dataset on the recent outstanding NER and EE models.- Anthology ID:
- 2023.findings-emnlp.863
- Volume:
- Findings of the Association for Computational Linguistics: EMNLP 2023
- Month:
- December
- Year:
- 2023
- Address:
- Singapore
- Editors:
- Houda Bouamor, Juan Pino, Kalika Bali
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 12959–12971
- Language:
- URL:
- https://aclanthology.org/2023.findings-emnlp.863
- DOI:
- 10.18653/v1/2023.findings-emnlp.863
- Cite (ACL):
- Dohee Kim, Ra Yoo, Soyoung Yang, Hee Yang, and Jaegul Choo. 2023. AniEE: A Dataset of Animal Experimental Literature for Event Extraction. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 12959–12971, Singapore. Association for Computational Linguistics.
- Cite (Informal):
- AniEE: A Dataset of Animal Experimental Literature for Event Extraction (Kim et al., Findings 2023)
- PDF:
- https://preview.aclanthology.org/ingest-2024-clasp/2023.findings-emnlp.863.pdf