From Simultaneous to Streaming Machine Translation by Leveraging Streaming History

Javier Iranzo Sanchez, Jorge Civera, Alfons Juan-Císcar


Abstract
Simultaneous machine translation has recently gained traction thanks to significant quality improvements and the advent of streaming applications. Simultaneous translation systems need to find a trade-off between translation quality and response time, and with this purpose multiple latency measures have been proposed. However, latency evaluations for simultaneous translation are estimated at the sentence level, not taking into account the sequential nature of a streaming scenario. Indeed, these sentence-level latency measures are not well suited for continuous stream translation, resulting in figures that are not coherent with the simultaneous translation policy of the system being assessed. This work proposes a stream-level adaptation of the current latency measures based on a re-segmentation approach applied to the output translation, that is successfully evaluated on streaming conditions for a reference IWSLT task
Anthology ID:
2022.acl-long.480
Volume:
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
May
Year:
2022
Address:
Dublin, Ireland
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
6972–6985
Language:
URL:
https://aclanthology.org/2022.acl-long.480
DOI:
10.18653/v1/2022.acl-long.480
Bibkey:
Cite (ACL):
Javier Iranzo Sanchez, Jorge Civera, and Alfons Juan-Císcar. 2022. From Simultaneous to Streaming Machine Translation by Leveraging Streaming History. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 6972–6985, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):
From Simultaneous to Streaming Machine Translation by Leveraging Streaming History (Iranzo Sanchez et al., ACL 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/2022.acl-long.480.pdf
Software:
 2022.acl-long.480.software.zip
Video:
 https://preview.aclanthology.org/auto-file-uploads/2022.acl-long.480.mp4
Data
MuST-C