Abstract
Argumentative corpora are costly to create and are available in only few languages with English dominating the area. In this paper we release the first publicly available Mandarin argumentative corpus. The corpus is created by exploiting the idea of comparable corpora from Statistical Machine Translation. We use existing corpora in English and manually map the claims and premises to comparable corpora in Mandarin. We also implement a simple solution to automate this approach with the view of creating argumentative corpora in other less-resourced languages. In this way we introduce a new task of multi-lingual argument mapping that can be evaluated using our English-Mandarin argumentative corpus. The preliminary results of our automatic argument mapper mirror the simplicity of our approach, but provide a baseline for further improvements.- Anthology ID:
- W17-5108
- Volume:
- Proceedings of the 4th Workshop on Argument Mining
- Month:
- September
- Year:
- 2017
- Address:
- Copenhagen, Denmark
- Venue:
- ArgMining
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 67–72
- Language:
- URL:
- https://aclanthology.org/W17-5108
- DOI:
- 10.18653/v1/W17-5108
- Cite (ACL):
- Ahmet Aker and Huangpan Zhang. 2017. Projection of Argumentative Corpora from Source to Target Languages. In Proceedings of the 4th Workshop on Argument Mining, pages 67–72, Copenhagen, Denmark. Association for Computational Linguistics.
- Cite (Informal):
- Projection of Argumentative Corpora from Source to Target Languages (Aker & Zhang, ArgMining 2017)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/W17-5108.pdf