Developing automatic verbatim transcripts for international multilingual meetings: an end-to-end solution

Akshat Dewan, Michal Ziemski, Henri Meylan, Lorenzo Concina, Bruno Pouliquen


Abstract
This paper presents an end-to-end solution for the creation of fully automated conference meeting transcripts and their machine translations into various languages. This tool has been developed at the World Intellectual Property Organization (WIPO) using in-house developed speech-to-text (S2T) and machine translation (MT) components. Beyond describing data collection and fine-tuning, resulting in a highly customized and robust system, this paper describes the architecture and evolution of the technical components as well as highlights the business impact and benefits from the user side. We also point out particular challenges in the evolution and adoption of the system and how the new approach created a new product and replaced existing established workflows in conference management documentation.
Anthology ID:
2023.mtsummit-users.18
Volume:
Proceedings of Machine Translation Summit XIX, Vol. 2: Users Track
Month:
September
Year:
2023
Address:
Macau SAR, China
Editors:
Masaru Yamada, Felix do Carmo
Venue:
MTSummit
SIG:
Publisher:
Asia-Pacific Association for Machine Translation
Note:
Pages:
183–194
Language:
URL:
https://aclanthology.org/2023.mtsummit-users.18
DOI:
Bibkey:
Cite (ACL):
Akshat Dewan, Michal Ziemski, Henri Meylan, Lorenzo Concina, and Bruno Pouliquen. 2023. Developing automatic verbatim transcripts for international multilingual meetings: an end-to-end solution. In Proceedings of Machine Translation Summit XIX, Vol. 2: Users Track, pages 183–194, Macau SAR, China. Asia-Pacific Association for Machine Translation.
Cite (Informal):
Developing automatic verbatim transcripts for international multilingual meetings: an end-to-end solution (Dewan et al., MTSummit 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-5/2023.mtsummit-users.18.pdf