Abstract
In this paper, we present the event detection models and systems we have developed for Multilingual Protest News Detection - Shared Task 1 at CASE 2021. The shared task has 4 subtasks which cover event detection at different granularity levels (from document level to token level) and across multiple languages (English, Hindi, Portuguese and Spanish). To handle data from multiple languages, we use a multilingual transformer-based language model (XLM-R) as the input text encoder. We apply a variety of techniques and build several transformer-based models that perform consistently well across all the subtasks and languages. Our systems achieve an average F_1 score of 81.2. Out of thirteen subtask-language tracks, our submissions rank 1st in nine and 2nd in four tracks.- Anthology ID:
- 2021.case-1.18
- Volume:
- Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021)
- Month:
- August
- Year:
- 2021
- Address:
- Online
- Editor:
- Ali Hürriyetoğlu
- Venue:
- CASE
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 138–146
- Language:
- URL:
- https://aclanthology.org/2021.case-1.18
- DOI:
- 10.18653/v1/2021.case-1.18
- Cite (ACL):
- Parul Awasthy, Jian Ni, Ken Barker, and Radu Florian. 2021. IBM MNLP IE at CASE 2021 Task 1: Multigranular and Multilingual Event Detection on Protest News. In Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021), pages 138–146, Online. Association for Computational Linguistics.
- Cite (Informal):
- IBM MNLP IE at CASE 2021 Task 1: Multigranular and Multilingual Event Detection on Protest News (Awasthy et al., CASE 2021)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-4/2021.case-1.18.pdf