Christoph Tillmann
Also published as: C. Tillmann
2023
Muted: Multilingual Targeted Offensive Speech Identification and Visualization
Christoph Tillmann | Aashka Trivedi | Sara Rosenthal | Santosh Borse | Rong Zhang | Avirup Sil | Bishwaranjan Bhattacharjee
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
Christoph Tillmann | Aashka Trivedi | Sara Rosenthal | Santosh Borse | Rong Zhang | Avirup Sil | Bishwaranjan Bhattacharjee
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
Offensive language such as hate, abuse, and profanity (HAP) occurs in various content on the web. While previous work has mostly dealt with sentence level annotations, there have been a few recent attempts to identify offensive spans as well. We build upon this work and introduce MUTED, a system to identify multilingual HAP content by displaying offensive arguments and their targets using heat maps to indicate their intensity. MUTED can leverage any transformer-based HAP-classification model and its attention mechanism out-of-the-box to identify toxic spans, without further fine-tuning. In addition, we use the spaCy library to identify the specific targets and arguments for the words predicted by the attention heatmaps. We present the model’s performance on identifying offensive spans and their targets in existing datasets and present new annotations on German text. Finally, we demonstrate our proposed visualization tool on multilingual inputs.
2014
Automatic dialect classification for statistical machine translation
Saab Mansour | Yaser Al-Onaizan | Graeme Blackwood | Christoph Tillmann
Proceedings of the 11th Conference of the Association for Machine Translation in the Americas: MT Researchers Track
Saab Mansour | Yaser Al-Onaizan | Graeme Blackwood | Christoph Tillmann
Proceedings of the 11th Conference of the Association for Machine Translation in the Americas: MT Researchers Track
The training data for statistical machine translation are gathered from various sources representing a mixture of domains. In this work, we argue that when translating dialects representing varieties of the same language, a manually assigned data source is not a reliable indicator of the dialect. We resort to automatic dialect classification to refine the training corpora according to the different dialects and build improved dialect specific systems. A fairly standard classifier for Arabic developed within this work achieves state-of-the-art performance, with classification precision above 90%, making it usefully accurate for our application. The classification of the data is then used to distinguish between the different dialects, split the data accordingly, and utilize the new splits for several adaptation techniques. Performing translation experiments on a large scale dialectal Arabic to English translation task, our results show that the classifier generates better contrast between the dialects and achieves superior translation quality than using the original manual corpora splits.
Improved Sentence-Level Arabic Dialect Classification
Christoph Tillmann | Saab Mansour | Yaser Al-Onaizan
Proceedings of the First Workshop on Applying NLP Tools to Similar Languages, Varieties and Dialects
Christoph Tillmann | Saab Mansour | Yaser Al-Onaizan
Proceedings of the First Workshop on Applying NLP Tools to Similar Languages, Varieties and Dialects
2009
A Simple Sentence-Level Extraction Algorithm for Comparable Data
Christoph Tillmann | Jian-ming Xu
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Christoph Tillmann | Jian-ming Xu
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
A Beam-Search Extraction Algorithm for Comparable Data
Christoph Tillmann
Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
Christoph Tillmann
Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
2008
A Rule-Driven Dynamic Programming Decoder for Statistical MT
Christoph Tillmann
Proceedings of the ACL-08: HLT Second Workshop on Syntax and Structure in Statistical Translation (SSST-2)
Christoph Tillmann
Proceedings of the ACL-08: HLT Second Workshop on Syntax and Structure in Statistical Translation (SSST-2)
2006
A Discriminative Global Training Algorithm for Statistical MT
Christoph Tillmann | Tong Zhang
Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics
Christoph Tillmann | Tong Zhang
Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics
Efficient Dynamic Programming Search Algorithms for Phrase-Based SMT
Christoph Tillmann
Proceedings of the Workshop on Computationally Hard Problems and Joint Inference in Speech and Language Processing
Christoph Tillmann
Proceedings of the Workshop on Computationally Hard Problems and Joint Inference in Speech and Language Processing
2005
A Localized Prediction Model for Statistical Machine Translation
Christoph Tillmann | Tong Zhang
Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL’05)
Christoph Tillmann | Tong Zhang
Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL’05)
2004
A Unigram Orientation Model for Statistical Machine Translation
Christoph Tillmann
Proceedings of HLT-NAACL 2004: Short Papers
Christoph Tillmann
Proceedings of HLT-NAACL 2004: Short Papers
2003
Word Reordering and a Dynamic Programming Beam Search Algorithm for Statistical Machine Translation
Christoph Tillmann | Hermann Ney
Computational Linguistics, Volume 29, Number 1, March 2003
Christoph Tillmann | Hermann Ney
Computational Linguistics, Volume 29, Number 1, March 2003
A Phrase-based Unigram Model for Statistical Machine Translation
Christoph Tillmann | Fei Xia
Companion Volume of the Proceedings of HLT-NAACL 2003 - Short Papers
Christoph Tillmann | Fei Xia
Companion Volume of the Proceedings of HLT-NAACL 2003 - Short Papers
TIPS: A Translingual Information Processing System
Yaser Al-Onaizan | Radu Florian | Martin Franz | Hany Hassan | Young-Suk Lee | J. Scott McCarley | Kishore Papineni | Salim Roukos | Jeffrey Sorensen | Christoph Tillmann | Todd Ward | Fei Xia
Companion Volume of the Proceedings of HLT-NAACL 2003 - Demonstrations
Yaser Al-Onaizan | Radu Florian | Martin Franz | Hany Hassan | Young-Suk Lee | J. Scott McCarley | Kishore Papineni | Salim Roukos | Jeffrey Sorensen | Christoph Tillmann | Todd Ward | Fei Xia
Companion Volume of the Proceedings of HLT-NAACL 2003 - Demonstrations
A Projection Extension Algorithm for Statistical Machine Translation
Christoph Tillmann
Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing
Christoph Tillmann
Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing
2000
Word Re-ordering and DP-based Search in Statistical Machine Translation
Christoph Tillmann | Hermann Ney
COLING 2000 Volume 2: The 18th International Conference on Computational Linguistics
Christoph Tillmann | Hermann Ney
COLING 2000 Volume 2: The 18th International Conference on Computational Linguistics
1999
A Statistical Parser for Czech
Michael Collins | Jan Hajic | Lance Ramshaw | Christoph Tillmann
Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics
Michael Collins | Jan Hajic | Lance Ramshaw | Christoph Tillmann
Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics
Improved Alignment Models for Statistical Machine Translation
Franz Josef Och | Christoph Tillmann | Hermann Ney
1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora
Franz Josef Och | Christoph Tillmann | Hermann Ney
1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora
1998
A DP based Search Algorithm for Statistical Machine Translation
S. Nießen | S. Vogel | H. Ney | C. Tillmann
COLING 1998 Volume 2: The 17th International Conference on Computational Linguistics
S. Nießen | S. Vogel | H. Ney | C. Tillmann
COLING 1998 Volume 2: The 17th International Conference on Computational Linguistics
A DP based Search Algorithm for Statistical Machine Translation
S. Nießen | S. Vogel | H. Ney | C. Tillmann
36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Volume 2
S. Nießen | S. Vogel | H. Ney | C. Tillmann
36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Volume 2
1997
A DP-based Search Using Monotone Alignments in Statistical Translation
Christoph Tillmann | Stephan Vogel | Hermann Ney | Alex Zubiaga
35th Annual Meeting of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics
Christoph Tillmann | Stephan Vogel | Hermann Ney | Alex Zubiaga
35th Annual Meeting of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics
Word Triggers and the EM Algorithm
Christoph Tillmann | Hermann Ney
CoNLL97: Computational Natural Language Learning
Christoph Tillmann | Hermann Ney
CoNLL97: Computational Natural Language Learning
1996
Search
Fix author
Co-authors
- Hermann Ney 8
- Stephan Vogel 4
- Yaser Al-Onaizan 3
- Saab Mansour 2
- Sonja Nießen 2
- Fei Xia 2
- Tong Zhang 2
- Bishwaranjan Bhattacharjee 1
- Graeme Blackwood 1
- Santosh Borse 1
- Michael Collins 1
- Radu Florian 1
- Martin Franz 1
- Jan Hajic 1
- Hany Hassan Awadalla 1
- Young-Suk Lee 1
- J. Scott McCarley 1
- Franz Josef Och 1
- Kishore Papineni 1
- Lance Ramshaw 1
- Sara Rosenthal 1
- Salim Roukos 1
- Avirup Sil 1
- Jeffrey Sorensen 1
- Aashka Trivedi 1
- Todd Ward 1
- Jian-ming Xu 1
- Rong Zhang 1
- Alex Zubiaga 1