Abstract
It is reported that grammatical information is useful for machine translation (MT) task. However, the annotation of grammatical information requires the highly human resources. Furthermore, it is not trivial to adapt grammatical information to MT since grammatical annotation usually adapts tokenization standards which might not be suitable to capture the relation of two languages, and the use of sub-word tokenization, e.g., Byte-Pair-Encoding, to alleviate out-of-vocabulary problem might not be compatible with those annotations. In this work, we propose two methods to explicitly incorporate grammatical information without supervising annotation; first, latent phrase structure is induced in an unsupervised fashion from a multi-head attention mechanism; second, the induced phrase structures in encoder and decoder are synchronized so that they are compatible with each other using constraints during training. We demonstrate that our approach produces better performance and explainability in two tasks, translation and alignment tasks without extra resources. Although we could not obtain the high quality phrase structure in constituency parsing when evaluated monolingually, we find that the induced phrase structures enhance the explainability of translation through the synchronization constraint.- Anthology ID:
- 2021.acl-srw.33
- Volume:
- Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: Student Research Workshop
- Month:
- August
- Year:
- 2021
- Address:
- Online
- Editors:
- Jad Kabbara, Haitao Lin, Amandalynne Paullada, Jannis Vamvas
- Venues:
- ACL | IJCNLP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 321–330
- Language:
- URL:
- https://aclanthology.org/2021.acl-srw.33
- DOI:
- 10.18653/v1/2021.acl-srw.33
- Cite (ACL):
- Shintaro Harada and Taro Watanabe. 2021. Neural Machine Translation with Synchronous Latent Phrase Structure. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: Student Research Workshop, pages 321–330, Online. Association for Computational Linguistics.
- Cite (Informal):
- Neural Machine Translation with Synchronous Latent Phrase Structure (Harada & Watanabe, ACL-IJCNLP 2021)
- PDF:
- https://preview.aclanthology.org/emnlp22-frontmatter/2021.acl-srw.33.pdf
- Data
- ASPEC