Template-free Data-to-Text Generation of Finnish Sports News

Jenna Kanerva, Samuel Rönnqvist, Riina Kekki, Tapio Salakoski, Filip Ginter

[How to correct problems with metadata yourself]


Abstract
News articles such as sports game reports are often thought to closely follow the underlying game statistics, but in practice they contain a notable amount of background knowledge, interpretation, insight into the game, and quotes that are not present in the official statistics. This poses a challenge for automated data-to-text news generation with real-world news corpora as training data. We report on the development of a corpus of Finnish ice hockey news, edited to be suitable for training of end-to-end news generation methods, as well as demonstrate generation of text, which was judged by journalists to be relatively close to a viable product. The new dataset and system source code are available for research purposes.
Anthology ID:
W19-6125
Volume:
Proceedings of the 22nd Nordic Conference on Computational Linguistics
Month:
September–October
Year:
2019
Address:
Turku, Finland
Editors:
Mareike Hartmann, Barbara Plank
Venue:
NoDaLiDa
SIG:
Publisher:
Linköping University Electronic Press
Note:
Pages:
242–252
Language:
URL:
https://aclanthology.org/W19-6125
DOI:
Bibkey:
Cite (ACL):
Jenna Kanerva, Samuel Rönnqvist, Riina Kekki, Tapio Salakoski, and Filip Ginter. 2019. Template-free Data-to-Text Generation of Finnish Sports News. In Proceedings of the 22nd Nordic Conference on Computational Linguistics, pages 242–252, Turku, Finland. Linköping University Electronic Press.
Cite (Informal):
Template-free Data-to-Text Generation of Finnish Sports News (Kanerva et al., NoDaLiDa 2019)
Copy Citation:
PDF:
https://preview.aclanthology.org/teach-a-man-to-fish/W19-6125.pdf
Code
 scoopmatic/finnish-hockey-news-generation-paper
Data
Ice Hockey News DatasetRotoWire