Input Representations for Parsing Discourse Representation Structures: Comparing English with Chinese

Chunliu Wang, Rik van Noord, Arianna Bisazza, Johan Bos


Abstract
Neural semantic parsers have obtained acceptable results in the context of parsing DRSs (Discourse Representation Structures). In particular models with character sequences as input showed remarkable performance for English. But how does this approach perform on languages with a different writing system, like Chinese, a language with a large vocabulary of characters? Does rule-based tokenisation of the input help, and which granularity is preferred: characters, or words? The results are promising. Even with DRSs based on English, good results for Chinese are obtained. Tokenisation offers a small advantage for English, but not for Chinese. Overall, characters are preferred as input, both for English and Chinese.
Anthology ID:
2021.acl-short.97
Volume:
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)
Month:
August
Year:
2021
Address:
Online
Venues:
ACL | IJCNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
767–775
Language:
URL:
https://aclanthology.org/2021.acl-short.97
DOI:
10.18653/v1/2021.acl-short.97
Bibkey:
Cite (ACL):
Chunliu Wang, Rik van Noord, Arianna Bisazza, and Johan Bos. 2021. Input Representations for Parsing Discourse Representation Structures: Comparing English with Chinese. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pages 767–775, Online. Association for Computational Linguistics.
Cite (Informal):
Input Representations for Parsing Discourse Representation Structures: Comparing English with Chinese (Wang et al., ACL 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/update-css-js/2021.acl-short.97.pdf
Code
 wangchunliu/chinese-drs-parsing