Daan van Esch
2022
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Julia Kreutzer
|
Isaac Caswell
|
Lisa Wang
|
Ahsan Wahab
|
Daan van Esch
|
Nasanbayar Ulzii-Orshikh
|
Allahsera Tapo
|
Nishant Subramani
|
Artem Sokolov
|
Claytone Sikasote
|
Monang Setyawan
|
Supheakmungkol Sarin
|
Sokhar Samb
|
Benoît Sagot
|
Clara Rivera
|
Annette Rios
|
Isabel Papadimitriou
|
Salomey Osei
|
Pedro Ortiz Suarez
|
Iroro Orife
|
Kelechi Ogueji
|
Andre Niyongabo Rubungo
|
Toan Q. Nguyen
|
Mathias Müller
|
André Müller
|
Shamsuddeen Hassan Muhammad
|
Nanda Muhammad
|
Ayanda Mnyakeni
|
Jamshidbek Mirzakhalov
|
Tapiwanashe Matangira
|
Colin Leong
|
Nze Lawson
|
Sneha Kudugunta
|
Yacine Jernite
|
Mathias Jenny
|
Orhan Firat
|
Bonaventure F. P. Dossou
|
Sakhile Dlamini
|
Nisansa de Silva
|
Sakine Çabuk Ballı
|
Stella Biderman
|
Alessia Battisti
|
Ahmed Baruwa
|
Ankur Bapna
|
Pallavi Baljekar
|
Israel Abebe Azime
|
Ayodele Awokoya
|
Duygu Ataman
|
Orevaoghene Ahia
|
Oghenefego Ahia
|
Sweta Agrawal
|
Mofetoluwa Adeyemi
Transactions of the Association for Computational Linguistics, Volume 10
Writing System and Speaker Metadata for 2,800+ Language Varieties
Daan van Esch
|
Tamar Lucassen
|
Sebastian Ruder
|
Isaac Caswell
|
Clara Rivera
Proceedings of the Thirteenth Language Resources and Evaluation Conference
2021
How Might We Create Better Benchmarks for Speech Recognition?
Alëna Aksënova
|
Daan van Esch
|
James Flynn
|
Pavel Golik
Proceedings of the 1st Workshop on Benchmarking: Past, Present and Future
2020
Data-Driven Parametric Text Normalization: Rapidly Scaling Finite-State Transduction Verbalizers to New Languages
Sandy Ritchie
|
Eoin Mahon
|
Kim Heiligenstein
|
Nikos Bampounis
|
Daan van Esch
|
Christian Schallhart
|
Jonas Mortensen
|
Benoit Brard
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages (SLTU) and Collaboration and Computing for Under-Resourced Languages (CCURL)
Language ID in the Wild: Unexpected Challenges on the Path to a Thousand-Language Web Text Corpus
Isaac Caswell
|
Theresa Breiner
|
Daan van Esch
|
Ankur Bapna
Proceedings of the 28th International Conference on Computational Linguistics
2019
Future Directions in Technological Support for Language Documentation
Daan van Esch
|
Ben Foley
|
Nay San
Proceedings of the 3rd Workshop on the Use of Computational Methods in the Study of Endangered Languages Volume 1 (Papers)
2018
Text Normalization Infrastructure that Scales to Hundreds of Language Varieties
Mason Chua
|
Daan van Esch
|
Noah Coccaro
|
Eunjoon Cho
|
Sujeet Bhandari
|
Libin Jia
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Co-authors
- Isaac Caswell 3
- Clara Rivera 2
- Ankur Bapna 2
- Julia Kreutzer 1
- Lisa Wang 1
- show all...
- Ahsan Wahab 1
- Nasanbayar Ulzii-Orshikh 1
- Allahsera Tapo 1
- Nishant Subramani 1
- Artem Sokolov 1
- Claytone Sikasote 1
- Monang Setyawan 1
- Supheakmungkol Sarin 1
- Sokhar Samb 1
- Benoît Sagot 1
- Annette Rios Gonzales 1
- Isabel Papadimitriou 1
- Salomey Osei 1
- Pedro Ortiz Suarez 1
- Iroro Orife 1
- Kelechi Ogueji 1
- Andre Niyongabo Rubungo 1
- Toan Q. Nguyen 1
- Mathias Müller 1
- André Müller 1
- Shamsuddeen Hassan Muhammad 1
- Nanda Muhammad 1
- Ayanda Mnyakeni 1
- Jamshidbek Mirzakhalov 1
- Tapiwanashe Matangira 1
- Colin Leong 1
- Nze Lawson 1
- Sneha Kudugunta 1
- Yacine Jernite 1
- Mathias Jenny 1
- Orhan Firat 1
- Bonaventure F. P. Dossou 1
- Sakhile Dlamini 1
- Nisansa de Silva 1
- Sakine Çabuk Ballı 1
- Stella Biderman 1
- Alessia Battisti 1
- Ahmed Baruwa 1
- Pallavi Baljekar 1
- Israel Abebe Azime 1
- Ayodele Awokoya 1
- Duygu Ataman 1
- Orevaoghene Ahia 1
- Oghenefego Ahia 1
- Sweta Agrawal 1
- Mofetoluwa Adeyemi 1
- Ben Foley 1
- Nay San 1
- Tamar Lucassen 1
- Sebastian Ruder 1
- Sandy Ritchie 1
- Eoin Mahon 1
- Kim Heiligenstein 1
- Nikos Bampounis 1
- Christian Schallhart 1
- Jonas Mortensen 1
- Benoit Brard 1
- Mason Chua 1
- Noah Coccaro 1
- Eunjoon Cho 1
- Sujeet Bhandari 1
- Libin Jia 1
- Alëna Aksënova 1
- James Flynn 1
- Pavel Golik 1
- Theresa Breiner 1