Stephan Busemann


2021

pdf
AVASAG: A German Sign Language Translation System for Public Services (short paper)
Fabrizio Nunnari | Judith Bauerdiek | Lucas Bernhard | Cristina España-Bonet | Corinna Jäger | Amelie Unger | Kristoffer Waldow | Sonja Wecker | Elisabeth André | Stephan Busemann | Christian Dold | Arnulph Fuhrmann | Patrick Gebhard | Yasser Hamidullah | Marcel Hauck | Yvonne Kossel | Martin Misiak | Dieter Wallach | Alexander Stricker
Proceedings of the 1st International Workshop on Automatic Translation for Signed and Spoken Languages (AT4SSL)

This paper presents an overview of AVASAG; an ongoing applied-research project developing a text-to-sign-language translation system for public services. We describe the scientific innovation points (geometry-based SL-description, 3D animation and video corpus, simplified annotation scheme, motion capture strategy) and the overall translation pipeline.

2020

pdf
Language Data Sharing in European Public Services – Overcoming Obstacles and Creating Sustainable Data Sharing Infrastructures
Lilli Smal | Andrea Lösch | Josef van Genabith | Maria Giagkou | Thierry Declerck | Stephan Busemann
Proceedings of the Twelfth Language Resources and Evaluation Conference

Data is key in training modern language technologies. In this paper, we summarise the findings of the first pan-European study on obstacles to sharing language data across 29 EU Member States and CEF-affiliated countries carried out under the ELRC White Paper action on Sustainable Language Data Sharing to Support Language Equality in Multilingual Europe. Why Language Data Matters. We present the methodology of the study, the obstacles identified and report on recommendations on how to overcome those. The obstacles are classified into (1) lack of appreciation of the value of language data, (2) structural challenges, (3) disposition towards CAT tools and lack of digital skills, (4) inadequate language data management practices, (5) limited access to outsourced translations, and (6) legal concerns. Recommendations are grouped into addressing the European/national policy level, and the organisational/institutional level.

2017

pdf bib
Common Round: Application of Language Technologies to Large-Scale Web Debates
Hans Uszkoreit | Aleksandra Gabryszak | Leonhard Hennig | Jörg Steffen | Renlong Ai | Stephan Busemann | Jon Dehdari | Josef van Genabith | Georg Heigold | Nils Rethmeier | Raphael Rubino | Sven Schmeier | Philippe Thomas | He Wang | Feiyu Xu
Proceedings of the Software Demonstrations of the 15th Conference of the European Chapter of the Association for Computational Linguistics

Web debates play an important role in enabling broad participation of constituencies in social, political and economic decision-taking. However, it is challenging to organize, structure, and navigate a vast number of diverse argumentations and comments collected from many participants over a long time period. In this paper we demonstrate Common Round, a next generation platform for large-scale web debates, which provides functions for eliciting the semantic content and structures from the contributions of participants. In particular, Common Round applies language technologies for the extraction of semantic essence from textual input, aggregation of the formulated opinions and arguments. The platform also provides a cross-lingual access to debates using machine translation.

2008

pdf
Identifying Foreign Person Names in Chinese Text
Stephan Busemann | Yajing Zhang
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

Foreign name expressions written in Chinese characters are difficult to recognize since the sequence of characters represents the Chinese pronunciation of the name. This paper suggests that known English or German person names can reliably be identified on the basis of the similarity between the Chinese and the foreign pronunciation. In addition to locating a person name in the text and learning that it is foreign, the corresponding foreign name is identified, thus gaining precious additional information for cross-lingual applications. This idea is implemented as a statistical module into the rule-based shallow parsing system SProUT, forming the HyFex system. The statistical component is invoked if a sequence of “trigger” characters is found that may correspond to a foreign name. Their phonetic Pinyin representation is produced and compared to the phonetic representations (SAMPA) of given foreign names, which are generated by the MARY TTS system for German and English pronunciations. This comparison is achieved by a hand-crafted metric that assigns costs to specific edit operations. The person name corresponding to the SAMPA representation with the lowest costs attached is returned as the most similar result, if a threshold is not exceeded. Our evaluation on publicly available data shows competitive results.

2007

pdf bib
Proceedings of the Eleventh European Workshop on Natural Language Generation (ENLG 07)
Stephan Busemann
Proceedings of the Eleventh European Workshop on Natural Language Generation (ENLG 07)

2006

pdf
Annotating text using the Linguistic Description Scheme of MPEG-7: The DIRECT-INFO Scenario
Thierry Declerck | Stephan Busemann | Herwig Rehatschek | Gert Kienast
Proceedings of the 5th Workshop on NLP and XML (NLPXML-2006): Multi-Dimensional Markup in Natural Language Processing

2005

pdf
Ten Years After: An Update on TG/2 (and Friends)
Stephan Busemann
Proceedings of the Tenth European Workshop on Natural Language Generation (ENLG-05)

2004

pdf
Resources and Techniques for Multilingual Information Extraction
Stephan Busemann | Hans-Ulrich Krieger
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

Official travel warnings published regularly in the internet by the ministries for foreign affairs of France, Germany, and the UK provide a useful resource for assessing the risks associated with travelling to some countries. The shallow IE system SProUT has been extended to meet the specific needs of delivering a language-neutral output for English, French, or German input texts. A shared type hierarchy, a feature-enhanced gazetteer resource, and generic techniques of merging chunk analyses into larger results are major reusable results of this work.

pdf
EGRAM – A Grammar Development Environment and its Usage for Language Generation
Stephan Busemann
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

2003

pdf
Integrating Information Extraction and Automatic Hyperlinking
Stephan Busemann | Witold Drozdzynski | Hans-Ulrich Krieger | Jakub Piskorski | Ulrich Schaefer | Hans Uszkoreit | Feiyu Xu
The Companion Volume to the Proceedings of 41st Annual Meeting of the Association for Computational Linguistics

2002

pdf
Multilingual Summarization by Integrating Linguistic Resources in the MLIS-MUSI Project
Alessandro Lenci | Roberto Bartolini | Nicoletta Calzolari | Ana Agua | Stephan Busemann | Emmanuel Cartier | Karine Chevreau | José Coch
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)

2000

pdf
Message Classification in the Call Center
Stephan Busemann | Sven Schmeier | Roman G. Arens
Sixth Applied Natural Language Processing Conference

1998

pdf
A Flexible Shallow Approach to Text Generation
Stephan Busemann | Helmut Horacek
Natural Language Generation

1997

pdf
Natural Language Dialogue Service for Appointment Scheduling Agents
Stephan Busemann | Thierry Declerck | Abdel Kader Diagne | Luca Dini | Judith Klein | Sven Schmeier
Fifth Conference on Applied Natural Language Processing

pdf
Automating NL Appointment Scheduling with COSMA
Stephan Busemann
Fifth Conference on Applied Natural Language Processing: Descriptions of System Demonstrations and Videos

1996

pdf
Best-First Surface Realization
Stephan Busemann
Eighth International Natural Language Generation Workshop

1994

pdf
DISCO-An HPSG-based NLP System and its Application for Appointment Scheduling Project Note
Hans Uszkoreit | Rolf Backofen | Stephan Busemann | Abdel Kader Diagne | Elizabeth A. Hinkelman | Walter Kasper | Bernd Kiefer | Hans-Ulrich Krieger | Klaus Netter | Gunter Neumann | Stephan Oepen | Stephen P. Spackman
COLING 1994 Volume 1: The 15th International Conference on Computational Linguistics

1991

pdf
Structure-Driven Generation From Separate Semantic Representations
Stephan Busemann
Fifth Conference of the European Chapter of the Association for Computational Linguistics

1988

pdf
A Constructive View of GPSG or How to Make It Work
Stephan Busemann | Christa Hauenschild
Coling Budapest 1988 Volume 1: International Conference on Computational Linguistics