John Lehmann


2006

pdf
What in the world is a Shahab?: Wide Coverage Named Entity Recognition for Arabic
Luke Nezda | Andrew Hickl | John Lehmann | Sarmad Fayyaz
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

This paper describes the development of CiceroArabic, the first wide coverage named entity recognition (NER) system for Modern Standard Arabic. Capable of classifying 18 different named entity classes with over 85% F, CiceroArabic utilizes a new 800,000-word annotated Arabic newswire corpus in order to achieve high performance without the need for hand-crafted rules or morphological information. In addition to describing results from our system, we show that accurate named entity annotation for a large number of semantic classes is feasible, even for very large corpora, and we discuss new techniques designed to boost agreement and consistency among annotators over a long-term annotation effort.

pdf
FERRET: Interactive Question-Answering for Real-World Environments
Andrew Hickl | Patrick Wang | John Lehmann | Sanda Harabagiu
Proceedings of the COLING/ACL 2006 Interactive Presentation Sessions

2005

pdf
Experiments with Interactive Question-Answering
Sanda Harabagiu | Andrew Hickl | John Lehmann | Dan Moldovan
Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL’05)

2004

pdf
Experiments with Interactive Question Answering in Complex Scenarios
Andrew Hickl | John Lehmann | John Williams | Sanda Harabagiu
Proceedings of the Workshop on Pragmatics of Question Answering at HLT-NAACL 2004