Chandresh Kumar Maurya
Also published as: Chandresh Kumar Maurya
2026
Speech Translation and Metrics in 2026: Findings of the IWSLT Campaign
David Ifeoluwa Adelani | Victor Agostinelli | Antonios Anastasopoulos | Luisa Bentivogli | Ondřej Bojar | Sébastien Bratières | Marine Carpuat | Fabrício Carraro | Roldano Cattoni | Mauro Cettolo | Lizhong Chen | Marcello Federico | Marco Gaido | Mahendra Gupta | HyoJung Han | Ali Hatami | Lewis C. Howe | Dávid Javorský | Yejin Jeon | Marek Kasztelnik | Antoine Laurent | Danni Liu | Nam Luu | Min Ma | Dominik Macháček | Marie Maltais | Evgeny Matusov | John McCrae | Chutong Meng | Chandresh Kumar Maurya | Mohammad Mohammadamini | Yasmin Moslem | Kenton Murray | Satoshi Nakamura | Matteo Negri | Jan Niehues | Atul Kr. Ojha | John E. Ortega | Siqi Ouyang | Sara Papi | Peter Polák | Fabian Retkowski | Stephanny Sánchez | Beatrice Savoldi | Claytone Sikasote | Matthias Sperber | Sebastian Stüker | Katsuhito Sudoh | Marie Tahon | Marco Turchi | Alexander Waibel | Patrick Wilken | Rodolfo Joel Zevallos | Vilem Zouhar | Maike Züfle
Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
David Ifeoluwa Adelani | Victor Agostinelli | Antonios Anastasopoulos | Luisa Bentivogli | Ondřej Bojar | Sébastien Bratières | Marine Carpuat | Fabrício Carraro | Roldano Cattoni | Mauro Cettolo | Lizhong Chen | Marcello Federico | Marco Gaido | Mahendra Gupta | HyoJung Han | Ali Hatami | Lewis C. Howe | Dávid Javorský | Yejin Jeon | Marek Kasztelnik | Antoine Laurent | Danni Liu | Nam Luu | Min Ma | Dominik Macháček | Marie Maltais | Evgeny Matusov | John McCrae | Chutong Meng | Chandresh Kumar Maurya | Mohammad Mohammadamini | Yasmin Moslem | Kenton Murray | Satoshi Nakamura | Matteo Negri | Jan Niehues | Atul Kr. Ojha | John E. Ortega | Siqi Ouyang | Sara Papi | Peter Polák | Fabian Retkowski | Stephanny Sánchez | Beatrice Savoldi | Claytone Sikasote | Matthias Sperber | Sebastian Stüker | Katsuhito Sudoh | Marie Tahon | Marco Turchi | Alexander Waibel | Patrick Wilken | Rodolfo Joel Zevallos | Vilem Zouhar | Maike Züfle
Proceedings of the 23rd International Conference on Spoken Language Translation (IWSLT 2026)
This paper reports on the outcomes of the shared tasks organized as part of the 23rd International Workshop on Spoken Language Translation (IWSLT). The workshop covered ten major challenges in spoken language translation, including speech-to-text translation for both high-resource and low-resource language pairs, customized speech translation, speech generation, instruction-following speech processing, and the evaluation of speech translation systems. The shared tasks received strong participation, with more than 30 teams submitting runs. This year’s edition broadened the range of tasks, placing particular emphasis on speech generation and evaluation metrics.
CLAOCS-TX: Cross-Lingual Triplet Extraction with Aspect-Opinion-Aware Code-Switched Prompting and LLM-Guided Contrastive Distillation
Lipika Dewangan | Chandresh Kumar Maurya
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Lipika Dewangan | Chandresh Kumar Maurya
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Cross-lingual learning enables the transfer of structured sentiment knowledge from high-resource languages to unlabeled or low-resource languages, but prior work has largely focused on coarse-grained sentiment classification or aspect extraction. In contrast, zero-shot cross-lingual aspect–opinion–sentiment triplet extraction (ASTE), which extracts sentiment triplets of the form (aspect term, opinion term, sentiment polarity), remains underexplored. We propose a unified framework that leverages large language models (LLMs) as both structured pseudo-label generators and semantic teachers for ASTE. Our approach employs stepwise structured prompting over aspect- and opinion-aware code-switched variants to generate reliable pseudo triplets, followed by a multi-variant consistency filter to retain high-confidence supervision. We further introduce a triplet-aware contrastive distillation objective that aligns student triplet representations with LLM-encoded semantic embeddings. During inference, only the student ASTE model is used, without requiring LLM access. Experiments on four non-Indic and four low-resource Indic target languages show consistent improvements over strong cross-lingual and LLM-based baselines. The proposed method yields an absolute micro-F1 improvement of 5.3 points on non-Indic languages and 3.8 points on low-resource Indic languages compared to the best competing approach. Ablation results further validate the complementary roles of aspect- and opinion-aware code-switched prompting and triplet-aware contrastive distillation, with larger relative gains observed in low-resource Indic settings.
2025
Findings of the IWSLT 2025 Evaluation Campaign
Idris Abdulmumin | Victor Agostinelli | Tanel Alumäe | Antonios Anastasopoulos | Luisa Bentivogli | Ondřej Bojar | Claudia Borg | Fethi Bougares | Roldano Cattoni | Mauro Cettolo | Lizhong Chen | William Chen | Raj Dabre | Yannick Estève | Marcello Federico | Mark Fishel | Marco Gaido | Dávid Javorský | Marek Kasztelnik | Fortuné Kponou | Mateusz Krubiński | Tsz Kin Lam | Danni Liu | Evgeny Matusov | Chandresh Kumar Maurya | John P. McCrae | Salima Mdhaffar | Yasmin Moslem | Kenton Murray | Satoshi Nakamura | Matteo Negri | Jan Niehues | Atul Kr. Ojha | John E. Ortega | Sara Papi | Pavel Pecina | Peter Polák | Piotr Połeć | Ashwin Sankar | Beatrice Savoldi | Nivedita Sethiya | Claytone Sikasote | Matthias Sperber | Sebastian Stüker | Katsuhito Sudoh | Brian Thompson | Marco Turchi | Alex Waibel | Patrick Wilken | Rodolfo Zevallos | Vilém Zouhar | Maike Züfle
Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025)
Idris Abdulmumin | Victor Agostinelli | Tanel Alumäe | Antonios Anastasopoulos | Luisa Bentivogli | Ondřej Bojar | Claudia Borg | Fethi Bougares | Roldano Cattoni | Mauro Cettolo | Lizhong Chen | William Chen | Raj Dabre | Yannick Estève | Marcello Federico | Mark Fishel | Marco Gaido | Dávid Javorský | Marek Kasztelnik | Fortuné Kponou | Mateusz Krubiński | Tsz Kin Lam | Danni Liu | Evgeny Matusov | Chandresh Kumar Maurya | John P. McCrae | Salima Mdhaffar | Yasmin Moslem | Kenton Murray | Satoshi Nakamura | Matteo Negri | Jan Niehues | Atul Kr. Ojha | John E. Ortega | Sara Papi | Pavel Pecina | Peter Polák | Piotr Połeć | Ashwin Sankar | Beatrice Savoldi | Nivedita Sethiya | Claytone Sikasote | Matthias Sperber | Sebastian Stüker | Katsuhito Sudoh | Brian Thompson | Marco Turchi | Alex Waibel | Patrick Wilken | Rodolfo Zevallos | Vilém Zouhar | Maike Züfle
Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025)
This paper presents the outcomes of the shared tasks conducted at the 22nd International Workshop on Spoken Language Translation (IWSLT). The workshop addressed seven critical challenges in spoken language translation: simultaneous and offline translation, automatic subtitling and dubbing, model compression, speech-to-speech translation, dialect and low-resource speech translation, and Indic languages. The shared tasks garnered significant participation, with 32 teams submitting their runs. The field’s growing importance is reflected in the increasing diversity of shared task organizers and contributors to this overview paper, representing a balanced mix of industrial and academic institutions. This broad participation demonstrates the rising prominence of spoken language translation in both research and practical applications.
Indic-S2ST: a Multilingual and Multimodal Many-to-Many Indic Speech-to-Speech Translation Dataset
Nivedita Sethiya | Puneet Walia | Chandresh Kumar Maurya
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics
Nivedita Sethiya | Puneet Walia | Chandresh Kumar Maurya
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics
Speech-to-Speech Translation (S2ST) converts speech from one language to speech in a different language. While various S2ST models exist, none adequately support Indic languages, primarily due to the lack of a suitable dataset. We fill this gap by introducing Indic-S2ST, a multilingual and multimodal many-to-many S2ST data of approximately 600 hours in 14 Indic languages, including Indian-accented English. To the best of our knowledge, this is the largest data for the S2ST task with parallel speech and text in 14 scheduled Indic languages. Our data also supports Automatic Speech Recognition (ASR), Text-to-Speech (TTS) synthesis, Speech-to-Text translation (ST), and Machine Translation (MT) due to parallel speech and text alignment. Thus, our data may be useful to train a model likeMeta’s SeamlessM4T for Indic languages. We also propose Indic-S2UT, a discrete unit-based S2ST model for Indic languages. To showcase the utility of the data, we present baseline results on the Indic-S2ST data using the Indic-S2UT. The dataset and codes are available at https://github.com/Nivedita5/Indic-S2ST/blob/main/README.md.
Search
Fix author
Co-authors
- Victor Agostinelli 2
- Antonios Anastasopoulos 2
- Luisa Bentivogli 2
- Ondřej Bojar 2
- Roldano Cattoni 2
- Mauro Cettolo 2
- Lizhong Chen 2
- Marcello Federico 2
- Marco Gaido 2
- Dávid Javorský 2
- Marek Kasztelnik 2
- Danni Liu 2
- Evgeny Matusov 2
- John Philip McCrae 2
- Yasmin Moslem 2
- Kenton Murray 2
- Satoshi Nakamura 2
- Matteo Negri 2
- Jan Niehues 2
- Atul Kr. Ojha 2
- John E. Ortega 2
- Sara Papi 2
- Peter Polák 2
- Beatrice Savoldi 2
- Nivedita Sethiya 2
- Claytone Sikasote 2
- Matthias Sperber 2
- Katsuhito Sudoh 2
- Marco Turchi 2
- Patrick Wilken 2
- Rodolfo Zevallos 2
- Vilém Zouhar 2
- Maike Züfle 2
- Idris Abdulmumin 1
- David Ifeoluwa Adelani 1
- Tanel Alumäe 1
- Claudia Borg 1
- Fethi Bougares 1
- Sébastien Bratières 1
- Marine Carpuat 1
- Fabrício Carraro 1
- William Chen 1
- Raj Dabre 1
- Lipika Dewangan 1
- Yannick Estève 1
- Mark Fishel 1
- Mahendra Gupta 1
- HyoJung Han 1
- Ali Hatami 1
- Lewis C. Howe 1
- Yejin Jeon 1
- Fortuné Kponou 1
- Mateusz Krubiński 1
- Tsz Kin Lam 1
- Antoine Laurent 1
- Nam Luu 1
- Min Ma 1
- Dominik Macháček 1
- Marie Maltais 1
- Salima Mdhaffar 1
- Chutong Meng 1
- Mohammad Mohammadamini 1
- Siqi Ouyang 1
- Pavel Pecina 1
- Piotr Połeć 1
- Fabian Retkowski 1
- Ashwin Sankar 1
- Sebastian Stüker 1
- Sebastian Stüker 1
- Stephanny Sánchez 1
- Marie Tahon 1
- Brian Thompson 1
- Alex Waibel 1
- Alexander Waibel 1
- Puneet Walia 1