Abstract
Neural semantic parsers have obtained acceptable results in the context of parsing DRSs (Discourse Representation Structures). In particular models with character sequences as input showed remarkable performance for English. But how does this approach perform on languages with a different writing system, like Chinese, a language with a large vocabulary of characters? Does rule-based tokenisation of the input help, and which granularity is preferred: characters, or words? The results are promising. Even with DRSs based on English, good results for Chinese are obtained. Tokenisation offers a small advantage for English, but not for Chinese. Overall, characters are preferred as input, both for English and Chinese.- Anthology ID:
- 2021.acl-short.97
- Volume:
- Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)
- Month:
- August
- Year:
- 2021
- Address:
- Online
- Editors:
- Chengqing Zong, Fei Xia, Wenjie Li, Roberto Navigli
- Venues:
- ACL | IJCNLP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 767–775
- Language:
- URL:
- https://preview.aclanthology.org/icon-24-ingestion/2021.acl-short.97/
- DOI:
- 10.18653/v1/2021.acl-short.97
- Cite (ACL):
- Chunliu Wang, Rik van Noord, Arianna Bisazza, and Johan Bos. 2021. Input Representations for Parsing Discourse Representation Structures: Comparing English with Chinese. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pages 767–775, Online. Association for Computational Linguistics.
- Cite (Informal):
- Input Representations for Parsing Discourse Representation Structures: Comparing English with Chinese (Wang et al., ACL-IJCNLP 2021)
- PDF:
- https://preview.aclanthology.org/icon-24-ingestion/2021.acl-short.97.pdf
- Code
- wangchunliu/chinese-drs-parsing