RAST: Domain-Robust Dialogue Rewriting as Sequence Tagging
Jie Hao, Linfeng Song, Liwei Wang, Kun Xu, Zhaopeng Tu, Dong Yu
Abstract
The task of dialogue rewriting aims to reconstruct the latest dialogue utterance by copying the missing content from the dialogue context. Until now, the existing models for this task suffer from the robustness issue, i.e., performances drop dramatically when testing on a different dataset. We address this robustness issue by proposing a novel sequence-tagging-based model so that the search space is significantly reduced, yet the core of this task is still well covered. As a common issue of most tagging models for text generation, the model’s outputs may lack fluency. To alleviate this issue, we inject the loss signal from BLEU or GPT-2 under a REINFORCE framework. Experiments show huge improvements of our model over the current state-of-the-art systems when transferring to another dataset.- Anthology ID:
- 2021.emnlp-main.402
- Volume:
- Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
- Month:
- November
- Year:
- 2021
- Address:
- Online and Punta Cana, Dominican Republic
- Editors:
- Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih
- Venue:
- EMNLP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 4913–4924
- Language:
- URL:
- https://preview.aclanthology.org/icon-24-ingestion/2021.emnlp-main.402/
- DOI:
- 10.18653/v1/2021.emnlp-main.402
- Cite (ACL):
- Jie Hao, Linfeng Song, Liwei Wang, Kun Xu, Zhaopeng Tu, and Dong Yu. 2021. RAST: Domain-Robust Dialogue Rewriting as Sequence Tagging. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 4913–4924, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
- Cite (Informal):
- RAST: Domain-Robust Dialogue Rewriting as Sequence Tagging (Hao et al., EMNLP 2021)
- PDF:
- https://preview.aclanthology.org/icon-24-ingestion/2021.emnlp-main.402.pdf