Abstract
Recent approaches to data-to-text generation have adopted the very successful encoder-decoder architecture or variants thereof. These models generate text that is fluent (but often imprecise) and perform quite poorly at selecting appropriate content and ordering it coherently. To overcome some of these issues, we propose a neural model with a macro planning stage followed by a generation stage reminiscent of traditional methods which embrace separate modules for planning and surface realization. Macro plans represent high level organization of important content such as entities, events, and their interactions; they are learned from data and given as input to the generator. Extensive experiments on two data-to-text benchmarks (RotoWire and MLB) show that our approach outperforms competitive baselines in terms of automatic and human evaluation.- Anthology ID:
- 2021.tacl-1.31
- Volume:
- Transactions of the Association for Computational Linguistics, Volume 9
- Month:
- Year:
- 2021
- Address:
- Cambridge, MA
- Editors:
- Brian Roark, Ani Nenkova
- Venue:
- TACL
- SIG:
- Publisher:
- MIT Press
- Note:
- Pages:
- 510–527
- Language:
- URL:
- https://aclanthology.org/2021.tacl-1.31
- DOI:
- 10.1162/tacl_a_00381
- Cite (ACL):
- Ratish Puduppully and Mirella Lapata. 2021. Data-to-text Generation with Macro Planning. Transactions of the Association for Computational Linguistics, 9:510–527.
- Cite (Informal):
- Data-to-text Generation with Macro Planning (Puduppully & Lapata, TACL 2021)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-4/2021.tacl-1.31.pdf