Věra Kloudová
Also published as: Vĕra Kloudová
2026
Challenges in Machine Translation of Interactive Multimodal Exercises
Lucie Polakova | Miroslav Hrabal | Věra Kloudová | Michal Novák | Mariia Anisimova | Martin Popel
Proceedings of the 21st Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2026)
Lucie Polakova | Miroslav Hrabal | Věra Kloudová | Michal Novák | Mariia Anisimova | Martin Popel
Proceedings of the 21st Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2026)
This paper describes linguistic and technological challenges encountered within an applied project aimed at expanding a large e-learning portal from its original Czech to three other languages: Ukrainian, English and German. Although there seems to be a general belief that machine translation is a solved task in 2026, we show that translating educational content, which in our case is highly terminological, multimodal, interactive and encoded in XML, brings along many challenges of different types, some easily solvable and some not. We also compare our results from the early phase of the project (Transformer-based machine translation) with those after the switch to the LLM-based translation methods. We show that both MT methods are prone to different types of errors, some of which are quite new (such as the undesired correction of counterfactual statements) and require new ways of handling them. The resulting four-language edition of the educational web portal will be freely available to educators, students and researchers by the end of 2026.
2022
Findings of the IWSLT 2022 Evaluation Campaign
Antonios Anastasopoulos | Loïc Barrault | Luisa Bentivogli | Marcely Zanon Boito | Ondřej Bojar | Roldano Cattoni | Anna Currey | Georgiana Dinu | Kevin Duh | Maha Elbayad | Clara Emmanuel | Yannick Estève | Marcello Federico | Christian Federmann | Souhir Gahbiche | Hongyu Gong | Roman Grundkiewicz | Barry Haddow | Benjamin Hsu | Dávid Javorský | Vĕra Kloudová | Surafel Lakew | Xutai Ma | Prashant Mathur | Paul McNamee | Kenton Murray | Maria Nǎdejde | Satoshi Nakamura | Matteo Negri | Jan Niehues | Xing Niu | John Ortega | Juan Pino | Elizabeth Salesky | Jiatong Shi | Matthias Sperber | Sebastian Stüker | Katsuhito Sudoh | Marco Turchi | Yogesh Virkar | Alexander Waibel | Changhan Wang | Shinji Watanabe
Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022)
Antonios Anastasopoulos | Loïc Barrault | Luisa Bentivogli | Marcely Zanon Boito | Ondřej Bojar | Roldano Cattoni | Anna Currey | Georgiana Dinu | Kevin Duh | Maha Elbayad | Clara Emmanuel | Yannick Estève | Marcello Federico | Christian Federmann | Souhir Gahbiche | Hongyu Gong | Roman Grundkiewicz | Barry Haddow | Benjamin Hsu | Dávid Javorský | Vĕra Kloudová | Surafel Lakew | Xutai Ma | Prashant Mathur | Paul McNamee | Kenton Murray | Maria Nǎdejde | Satoshi Nakamura | Matteo Negri | Jan Niehues | Xing Niu | John Ortega | Juan Pino | Elizabeth Salesky | Jiatong Shi | Matthias Sperber | Sebastian Stüker | Katsuhito Sudoh | Marco Turchi | Yogesh Virkar | Alexander Waibel | Changhan Wang | Shinji Watanabe
Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022)
The evaluation campaign of the 19th International Conference on Spoken Language Translation featured eight shared tasks: (i) Simultaneous speech translation, (ii) Offline speech translation, (iii) Speech to speech translation, (iv) Low-resource speech translation, (v) Multilingual speech translation, (vi) Dialect speech translation, (vii) Formality control for speech translation, (viii) Isometric speech translation. A total of 27 teams participated in at least one of the shared tasks. This paper details, for each shared task, the purpose of the task, the data that were released, the evaluation metrics that were applied, the submissions that were received and the results that were achieved.
2021
Detecting Post-Edited References and Their Effect on Human Evaluation
Věra Kloudová | Ondřej Bojar | Martin Popel
Proceedings of the Workshop on Human Evaluation of NLP Systems (HumEval)
Věra Kloudová | Ondřej Bojar | Martin Popel
Proceedings of the Workshop on Human Evaluation of NLP Systems (HumEval)
This paper provides a quick overview of possible methods how to detect that reference translations were actually created by post-editing an MT system. Two methods based on automatic metrics are presented: BLEU difference between the suspected MT and some other good MT and BLEU difference using additional references. These two methods revealed a suspicion that the WMT 2020 Czech reference is based on MT. The suspicion was confirmed in a manual analysis by finding concrete proofs of the post-editing procedure in particular sentences. Finally, a typology of post-editing changes is presented where typical errors or changes made by the post-editor or errors adopted from the MT are classified.
Search
Fix author
Co-authors
- Ondřej Bojar 2
- Martin Popel 2
- Antonios Anastasopoulos 1
- Mariia Anisimova 1
- Loic Barrault 1
- Luisa Bentivogli 1
- Roldano Cattoni 1
- Anna Currey 1
- Georgiana Dinu 1
- Kevin Duh 1
- Maha Elbayad 1
- Clara Emmanuel 1
- Yannick Estève 1
- Marcello Federico 1
- Christian Federmann 1
- Souhir Gahbiche 1
- Hongyu Gong 1
- Roman Grundkiewicz 1
- Barry Haddow 1
- Miroslav Hrabal 1
- Benjamin Hsu 1
- Dávid Javorský 1
- Surafel Lakew 1
- Xutai Ma 1
- Prashant Mathur 1
- Paul McNamee 1
- Kenton Murray 1
- Maria Nadejde 1
- Satoshi Nakamura 1
- Matteo Negri 1
- Jan Niehues 1
- Xing Niu 1
- Michal Novák 1
- John Ortega 1
- Juan Pino 1
- Lucie Polakova 1
- Elizabeth Salesky 1
- Jiatong Shi 1
- Matthias Sperber 1
- Sebastian Stüker 1
- Katsuhito Sudoh 1
- Marco Turchi 1
- Yogesh Virkar 1
- Alex Waibel 1
- Changhan Wang 1
- Shinji Watanabe 1
- Marcely Zanon Boito 1