Thomas Mayer
2014
Creating a massively parallel Bible corpus
Thomas Mayer | Michael Cysouw
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Thomas Mayer | Michael Cysouw
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
We present our ongoing effort to create a massively parallel Bible corpus. While an ever-increasing number of Bible translations is available in electronic form on the internet, there is no large-scale parallel Bible corpus that allows language researchers to easily get access to the texts and their parallel structure for a large variety of different languages. We report on the current status of the corpus, with over 900 translations in more than 830 language varieties. All translations are tokenized (e.g., separating punctuation marks) and Unicode normalized. Mainly due to copyright restrictions only portions of the texts are made publicly available. However, we provide co-occurrence information for each translation in a (sparse) matrix format. All word forms in the translation are given together with their frequency and the verses in which they occur.
2013
PhonMatrix: Visualizing co-occurrence constraints of sounds
Thomas Mayer | Christian Rohrdantz
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics: System Demonstrations
Thomas Mayer | Christian Rohrdantz
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics: System Demonstrations
2012
Introduction
Miriam Butt | Jelena Prokić | Thomas Mayer | Michael Cysouw
Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH
Miriam Butt | Jelena Prokić | Thomas Mayer | Michael Cysouw
Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH
Language comparison through sparse multilingual word alignment
Thomas Mayer | Michael Cysouw
Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH
Thomas Mayer | Michael Cysouw
Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH
2011
Towards Tracking Semantic Change by Visual Analytics
Christian Rohrdantz | Annette Hautli | Thomas Mayer | Miriam Butt | Daniel A. Keim | Frans Plank
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies
Christian Rohrdantz | Annette Hautli | Thomas Mayer | Miriam Butt | Daniel A. Keim | Frans Plank
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies