Abstract
Simultaneous machine translation has recently gained traction thanks to significant quality improvements and the advent of streaming applications. Simultaneous translation systems need to find a trade-off between translation quality and response time, and with this purpose multiple latency measures have been proposed. However, latency evaluations for simultaneous translation are estimated at the sentence level, not taking into account the sequential nature of a streaming scenario. Indeed, these sentence-level latency measures are not well suited for continuous stream translation, resulting in figures that are not coherent with the simultaneous translation policy of the system being assessed. This work proposes a stream-level adaptation of the current latency measures based on a re-segmentation approach applied to the output translation, that is successfully evaluated on streaming conditions for a reference IWSLT task- Anthology ID:
- 2022.acl-long.480
- Volume:
- Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
- Month:
- May
- Year:
- 2022
- Address:
- Dublin, Ireland
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 6972–6985
- Language:
- URL:
- https://aclanthology.org/2022.acl-long.480
- DOI:
- 10.18653/v1/2022.acl-long.480
- Cite (ACL):
- Javier Iranzo Sanchez, Jorge Civera, and Alfons Juan-Císcar. 2022. From Simultaneous to Streaming Machine Translation by Leveraging Streaming History. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 6972–6985, Dublin, Ireland. Association for Computational Linguistics.
- Cite (Informal):
- From Simultaneous to Streaming Machine Translation by Leveraging Streaming History (Iranzo Sanchez et al., ACL 2022)
- PDF:
- https://preview.aclanthology.org/nodalida-main-page/2022.acl-long.480.pdf
- Data
- MuST-C