Abstract
We show that it is straightforward to train a state of the art named entity tagger (spaCy) to recognize political actors in Dutch parliamentary proceedings with high accuracy. The tagger was trained on 3.4K manually labeled examples, which were created in a modest 2.5 days work. This resource is made available on github. Besides proper nouns of persons and political parties, the tagger can recognize quite complex definite descriptions referring to cabinet ministers, ministries, and parliamentary committees. We also provide a demo search engine which employs the tagged entities in its SERP and result summaries.- Anthology ID:
- 2020.parlaclarin-1.7
- Volume:
- Proceedings of the Second ParlaCLARIN Workshop
- Month:
- May
- Year:
- 2020
- Address:
- Marseille, France
- Editors:
- Darja Fišer, Maria Eskevich, Franciska de Jong
- Venue:
- ParlaCLARIN
- SIG:
- Publisher:
- European Language Resources Association
- Note:
- Pages:
- 35–39
- Language:
- English
- URL:
- https://aclanthology.org/2020.parlaclarin-1.7
- DOI:
- Cite (ACL):
- Lennart Kerkvliet, Jaap Kamps, and Maarten Marx. 2020. Who mentions whom? Recognizing political actors in proceedings. In Proceedings of the Second ParlaCLARIN Workshop, pages 35–39, Marseille, France. European Language Resources Association.
- Cite (Informal):
- Who mentions whom? Recognizing political actors in proceedings (Kerkvliet et al., ParlaCLARIN 2020)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-1/2020.parlaclarin-1.7.pdf