Abstract
We propose a shared task on methodologies and algorithms for evaluating the accuracy of generated texts, specifically summaries of basketball games produced from basketball box score and other game data. We welcome submissions based on protocols for human evaluation, automatic metrics, as well as combinations of human evaluations and metrics.- Anthology ID:
- 2020.inlg-1.28
- Volume:
- Proceedings of the 13th International Conference on Natural Language Generation
- Month:
- December
- Year:
- 2020
- Address:
- Dublin, Ireland
- Editors:
- Brian Davis, Yvette Graham, John Kelleher, Yaji Sripada
- Venue:
- INLG
- SIG:
- SIGGEN
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 227–231
- Language:
- URL:
- https://aclanthology.org/2020.inlg-1.28
- DOI:
- 10.18653/v1/2020.inlg-1.28
- Cite (ACL):
- Ehud Reiter and Craig Thomson. 2020. Shared Task on Evaluating Accuracy. In Proceedings of the 13th International Conference on Natural Language Generation, pages 227–231, Dublin, Ireland. Association for Computational Linguistics.
- Cite (Informal):
- Shared Task on Evaluating Accuracy (Reiter & Thomson, INLG 2020)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-2/2020.inlg-1.28.pdf