Abstract
In this paper, we describe Alibaba’s participating system in the semEval-2018 Task5: Counting Events and Participants in the Long Tail. We designed and implemented a pipeline system that consists of components to extract question properties and document features, document event category classifications, document retrieval and document clustering. To retrieve the majority of the relevant documents, we carefully designed our system to extract key information from each question and document pair. After retrieval, we perform further document clustering to count the number of events. The task contains 3 subtasks, on which we achieved F1 score of 78.33, 50.52, 63.59 , respectively, for document level retrieval. Our system ranks first in all the three subtasks on document level retrieval, and it also ranks first in incident-level evaluation by RSME measure in subtask 3.- Anthology ID:
- S18-1110
- Volume:
- Proceedings of the 12th International Workshop on Semantic Evaluation
- Month:
- June
- Year:
- 2018
- Address:
- New Orleans, Louisiana
- Venues:
- SemEval | *SEM
- SIGs:
- SIGLEX | SIGSEM
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 674–678
- Language:
- URL:
- https://aclanthology.org/S18-1110
- DOI:
- 10.18653/v1/S18-1110
- Cite (ACL):
- Yingchi Liu, Quanzhi Li, and Luo Si. 2018. NAI-SEA at SemEval-2018 Task 5: An Event Search System. In Proceedings of the 12th International Workshop on Semantic Evaluation, pages 674–678, New Orleans, Louisiana. Association for Computational Linguistics.
- Cite (Informal):
- NAI-SEA at SemEval-2018 Task 5: An Event Search System (Liu et al., SemEval-*SEM 2018)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/S18-1110.pdf