John A. Carroll - ACL Anthology

This is an internal, incomplete preview of a proposed change to the ACL Anthology. For efficiency reasons, we don't generate MODS or Endnote formats, and the preview may be incomplete in other ways, or contain mistakes. Do not treat this content as an official publication.

John A. Carroll

Cambridge, Sussex

Also published as: John Carroll

Other people with similar names: John B. Carroll (UNC)

2020

English Recipe Flow Graph Corpus
Yoko Yamakata | Shinsuke Mori | John Carroll
Proceedings of the Twelfth Language Resources and Evaluation Conference

We present an annotated corpus of English cooking recipe procedures, and describe and evaluate computational methods for learning these annotations. The corpus consists of 300 recipes written by members of the public, which we have annotated with domain-specific linguistic and semantic structure. Each recipe is annotated with (1) ‘recipe named entities’ (r-NEs) specific to the recipe domain, and (2) a flow graph representing in detail the sequencing of steps, and interactions between cooking tools, food ingredients and the products of intermediate steps. For these two kinds of annotations, inter-annotator agreement ranges from 82.3 to 90.5 F1, indicating that our annotation scheme is appropriate and consistent. We experiment with producing these annotations automatically. For r-NE tagging we train a deep neural network NER tool; to compute flow graphs we train a dependency-style parsing procedure which we apply to the entire sequence of r-NEs in a recipe. In evaluations, our systems achieve 71.1 to 87.5 F1, demonstrating that our annotation scheme is learnable.

2016

Using Linguistic Data for English and Spanish Verb-Noun Combination Identification
Uxoa Iñurrieta | Arantza Díaz de Ilarraza | Gorka Labaka | Kepa Sarasola | Itziar Aduriz | John Carroll
Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers

We present a linguistic analysis of a set of English and Spanish verb+noun combinations (VNCs), and a method to use this information to improve VNC identification. Firstly, a sample of frequent VNCs are analysed in-depth and tagged along lexico-semantic and morphosyntactic dimensions, obtaining satisfactory inter-annotator agreement scores. Then, a VNC identification experiment is undertaken, where the analysed linguistic data is combined with chunking information and syntactic dependencies. A comparison between the results of the experiment and the results obtained by a basic detection method shows that VNC identification can be greatly improved by using linguistic information, as a large number of additional occurrences are detected with high precision.

2014

Learning to Predict Distributions of Words Across Domains
Danushka Bollegala | David Weir | John Carroll
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

Chunking Clinical Text Containing Non-Canonical Language
Aleksandar Savkov | John Carroll | Jackie Cassell
Proceedings of BioNLP 2014

2013

Induction of Root and Pattern Lexicon for Unsupervised Morphological Analysis of Arabic
Bilal Khaliq | John Carroll
Proceedings of the Sixth International Joint Conference on Natural Language Processing

Unsupervised Induction of Arabic Root and Pattern Lexicons using Machine Learning
Bilal Khaliq | John Carroll
Proceedings of the International Conference Recent Advances in Natural Language Processing RANLP 2013

2011

Dependency Parsing Schemata and Mildly Non-Projective Dependency Parsing
Carlos Gómez-Rodríguez | John Carroll | David Weir
Computational Linguistics, Volume 37, Issue 3 - September 2011

Using Multiple Sources to Construct a Sentiment Sensitive Thesaurus for Cross-Domain Sentiment Classification
Danushka Bollegala | David Weir | John Carroll
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies

2010

Book Review: Dependency Parsing by Sandra Kübler, Ryan McDonald, and Joakim Nivre
John Carroll
Computational Linguistics, Volume 36, Number 1, March 2010

2009

Parsing Mildly Non-Projective Dependency Structures
Carlos Gómez-Rodríguez | David Weir | John Carroll
Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009)

Estimating and Exploiting the Entropy of Sense Distributions
Peng Jin | Diana McCarthy | Rob Koeling | John Carroll
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers

2008

Automatic Seed Word Selection for Unsupervised Sentiment Classification of Chinese Text
Taras Zagibalov | John Carroll
Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008)

Unsupervised Classification of Sentiment and Objectivity in Chinese Text
Taras Zagibalov | John Carroll
Proceedings of the Third International Joint Conference on Natural Language Processing: Volume-I

The BNC Parsed with RASP4UIMA
Øistein E. Andersen | Julien Nioche | Ted Briscoe | John Carroll
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

We have integrated the RASP system with the UIMA framework (RASP4UIMA) and used this to parse the XML-encoded version of the British National Corpus (BNC). All original annotation is preserved, and parsing information, mainly in the form of grammatical relations, is added in an XML format. A few specific adaptations of the system to give better results with the BNC are discussed briefly. The RASP4UIMA system is publicly available and can be used to parse other corpora or document collections, and the final parsed version of the BNC will be deposited with the Oxford Text Archive.

A Deductive Approach to Dependency Parsing
Carlos Gómez-Rodríguez | John Carroll | David Weir
Proceedings of ACL-08: HLT

2007

Unsupervised Acquisition of Predominant Word Senses
Diana McCarthy | Rob Koeling | Julie Weeds | John Carroll
Computational Linguistics, Volume 33, Number 4, December 2007

Annotating Expressions of Appraisal in English
Jonathon Read | David Hope | John Carroll
Proceedings of the Linguistic Annotation Workshop

Semi-supervised Training of a Statistical Parser from Unlabeled Partially-bracketed Data
Rebecca Watson | Ted Briscoe | John Carroll
Proceedings of the Tenth International Conference on Parsing Technologies

Efficiency in Unification-Based N-Best Parsing
Yi Zhang | Stephan Oepen | John Carroll
Proceedings of the Tenth International Conference on Parsing Technologies

Modelling control in generation
Roger Evans | David Weir | John Carroll | Daniel Paiva | Anja Belz
Proceedings of the Eleventh European Workshop on Natural Language Generation (ENLG 07)

2006

Evaluating the Accuracy of an Unlexicalized Statistical Parser on the PARC DepBank
Ted Briscoe | John Carroll
Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions

The Second Release of the RASP System
Ted Briscoe | John Carroll | Rebecca Watson
Proceedings of the COLING/ACL 2006 Interactive Presentation Sessions

2005

Domain-Specific Sense Distributions and Predominant Sense Acquisition
Rob Koeling | Diana McCarthy | John Carroll
Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing

Word Sense Disambiguation Using Sense Examples Automatically Acquired from a Second Language
Xinglong Wang | John Carroll
Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing

High Efficiency Realization for a Wide-Coverage Unification Grammar
John Carroll | Stephan Oepen
Second International Joint Conference on Natural Language Processing: Full Papers

Efficient Extraction of Grammatical Relations
Rebecca Watson | John Carroll | Ted Briscoe
Proceedings of the Ninth International Workshop on Parsing Technology

2004

Som å kapp-ete med trollet? – Towards MRS-based Norwegian-English machine translation
Stephan Oepen | Helge Dyvik | Jan Tore Lønning | Erik Velldal | Dorothee Beerman | John Carroll | Dan Flickinger | Lars Hellan | Janne Bondi Johannessen | Paul Meurer | Torbjørn Nordgård | Victoria Rosén
Proceedings of the 10th Conference on Theoretical and Methodological Issues in Machine Translation of Natural Languages

Automatic Identification of Infrequent Word Senses
Diana McCarthy | Rob Koeling | Julie Weeds | John Carroll
COLING 2004: Proceedings of the 20th International Conference on Computational Linguistics

Cross-Language Acquisition of Semantic Models for Verbal Predicates
Jordi Atserias | Bernardo Magnini | Octavian Popescu | Eneko Agirre | Aitziber Atutxa | German Rigau | John Carroll | Rob Koeling
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

Finding Predominant Word Senses in Untagged Text
Diana McCarthy | Rob Koeling | Julie Weeds | John Carroll
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL-04)

Using automatically acquired predominant senses for Word Sense Disambiguation
Diana McCarthy | Rob Koeling | Julie Weeds | John Carroll
Proceedings of SENSEVAL-3, the Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text

2003

Disambiguating Nouns, Verbs, and Adjectives Using Automatically Acquired Selectional Preferences
Diana McCarthy | John Carroll
Computational Linguistics, Volume 29, Number 4, December 2003

Detecting a Continuum of Compositionality in Phrasal Verbs
Diana McCarthy | Bill Keller | John Carroll
Proceedings of the ACL 2003 Workshop on Multiword Expressions: Analysis, Acquisition and Treatment

2002

High Precision Extraction of Grammatical Relations
John Carroll | Ted Briscoe
COLING 2002: The 19th International Conference on Computational Linguistics

Robust Accurate Statistical Annotation of General Text
Ted Briscoe | John Carroll
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)

MEANING: a Roadmap to Knowledge Technologies
German Rigau | Bernardo Magnini | Eneko Agirre | Piek Vossen | John Carroll
COLING-02: A Roadmap for Computational Linguistics

Evaluation of LTAG Parsing with Supertag Compaction
Olga Shaumyan | John Carroll | David Weir
Proceedings of the Sixth International Workshop on Tree Adjoining Grammar and Related Frameworks (TAG+6)

2001

Book Reviews: Robustness in Language and Speech Technology
John Carroll
Computational Linguistics, Volume 27, Number 4, December 2001

From RAGS to RICHES: Exploiting the Potential of a Flexible Generation Architecture
Lynne Cahill | John Carroll | Roger Evans | Daniel Paiva | Richard Power | Donia Scott | Kees van Deemter
Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics

Disambiguating Noun and Verb Senses Using Automatically Acquired Selectional Preferences
Diana McCarthy | John Carroll | Judita Preiss
Proceedings of SENSEVAL-2 Second International Workshop on Evaluating Word Sense Disambiguation Systems

Using an Open-Source Unification-Based System for CL/NLP Teaching
Anne Copestake | John Carroll | Dan Flickinger | Robert Malouf | Stephan Oepen
Proceedings of the ACL 2001 Workshop on Sharing Tools and Resources

2000

Ambiguity Packing in Constraint-based Parsing Practical Results
Stephan Oepen | John Carroll
1st Meeting of the North American Chapter of the Association for Computational Linguistics

Robust, applied morphological generation
Guido Minnen | John Carroll | Darren Pearce
INLG’2000 Proceedings of the First International Conference on Natural Language Generation

Proceedings of the COLING-2000 Workshop on Efficiency In Large-Scale Parsing Systems
John Carroll | Robert C. Moore | Stephan Oepen
Proceedings of the COLING-2000 Workshop on Efficiency In Large-Scale Parsing Systems

Efficient Large-Scale Parsing – a Survey
John Carroll | Stephan Oepen
Proceedings of the COLING-2000 Workshop on Efficiency In Large-Scale Parsing Systems

Engineering a Wide-Coverage Lexicalized Grammar
John Carroll | Nicolas Nicolov | Olga Shaumyan | Martine Smets | David Weir
Proceedings of the Fifth International Workshop on Tree Adjoining Grammar and Related Frameworks (TAG+5)

1999

Parsing with an Extended Domain of Locality
John Carroll | Nicolas Nicolov | Olga Shaumyan | Martine Smets | David Weir
Ninth Conference of the European Chapter of the Association for Computational Linguistics

Simplifying Text for Language-Impaired Readers
John Carroll | Guido Minnen | Darren Pearce | Yvonne Canning | Siobhan Devlin | John Tait
Ninth Conference of the European Chapter of the Association for Computational Linguistics

A Bag of Useful Techniques for Efficient and Robust Parsing
Bernd Kiefer | Hans-Ulrich Krieger | John Carroll | Rob Malouf
Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics

1998

The LexSys project
John Carroll | Nicolas Nicolov | Olga Shaumyan | Martine Smets | David Weir
Proceedings of the Fourth International Workshop on Tree Adjoining Grammars and Related Frameworks (TAG+4)

Can Subcategorisation Probabilities Help a Statistical Parser
John Carroll | Guido Minnen | Ted Briscoe
Sixth Workshop on Very Large Corpora

1997

Encoding Frequency Information in Lexicalized Grammars
John Carroll | David Weir
Proceedings of the Fifth International Workshop on Parsing Technologies

We address the issue of how to associate frequency information with lexicalized grammar formalisms, using Lexicalized Tree Adjoining Grammar as a representative framework. We consider systematically a number of alternative probabilistic frameworks, evaluating their adequacy from both a theoretical and empirical perspective using data from existing large treebanks. We also propose three orthogonal approaches fo r backing off probability estimates to cope with the large number of parameters involved.

Automatic Extraction of Subcategorization from Corpora
Ted Briscoe | John Carroll
Fifth Conference on Applied Natural Language Processing

Book Reviews: Industrial Parsing of Software Manuals
John Carroll
Computational Linguistics, Volume 23, Number 4, December 1997

1996

Apportioning Development Effort in a Probabilistic LR Parsing System Through Evaluation
John Carroll | Ted Briscoe
Conference on Empirical Methods in Natural Language Processing

1995

Developing and Evaluating a Probabilistic LR Parser of Part-of-Speech and Punctuation Labels
Ted Briscoe | John Carroll
Proceedings of the Fourth International Workshop on Parsing Technologies

We describe an approach to robust domain-independent syntactic parsing of unrestricted naturally-occurring (English) input. The technique involves parsing sequences of part-of-speech and punctuation labels using a unification-based grammar coupled with a probabilistic LR parser. We describe the coverage of several corpora using this grammar and report the results of a parsing experiment using probabilities derived from bracketed training data. We report the first substantial experiments to assess the contribution of punctuation to deriving an accurate syntactic analysis, by parsing identical texts both with and without naturally-occurring punctuation marks.

1994

Relating Complexity to Practical Performance in Parsing With Wide-Coverage Unification Grammars
John Carroll
32nd Annual Meeting of the Association for Computational Linguistics

1993

Generalized Probabilistic LR Parsing of Natural Language (Corpora) with Unification-Based Grammars
Ted Briscoe | John Carroll
Computational Linguistics, Volume 19, Number 1, March 1993, Special Issue on Using Large Corpora: I

1992

A Practical Approach to Multiple Default Inheritance for Unification-Based Lexicons
Graham Russell | Afzal Ballim | John Carroll | Susan Warwick-Armstrong
Computational Linguistics, Volume 18, Number 3, September 1992, Special Issue on Inheritance: II

1991

Multiple Default Inheritance in a Unification-Based Lexicon
Graham Russell | John Carroll | Susan Warwick-Armstrong
29th Annual Meeting of the Association for Computational Linguistics

1990

Asymmetry in Parsing and Generating with Unification Grammars: Case Studies From ELU
Graham Russell | John Carroll | Susan Warwick
28th Annual Meeting of the Association for Computational Linguistics

1988

Software Support for Practical Grammar Development
Bran Boguraev | John Carroll | Ted Briscoe | Claire Grover
Coling Budapest 1988 Volume 1: International Conference on Computational Linguistics

1987

The Derivation of a Grammatically Indexed Lexicon from the Longman Dictionary of Contemporary English
Bran Boguraev | Ted Briscoe | John Carroll | David Carter | Claire Grover
25th Annual Meeting of the Association for Computational Linguistics

1983

An Island Parsing Interpreter for the Full Augmented Transition Network Formalism
John A. Carroll
First Conference of the European Chapter of the Association for Computational Linguistics

Co-authors

Olga Shaumyan 4

Susan Armstrong 3

Dan Flickinger 3

Carlos Gómez-Rodríguez 3

Nicolas Nicolov 3

Graham Russell 3

Martine Smets 3

Rebecca Watson 3

Branimir Boguraev 2

Danushka Bollegala 2

Claire Grover 2

Jan Tore Lønning 2

Bernardo Magnini 2

Robert Malouf 2

Darren Pearce 2

Taras Zagibalov 2

Itziar Aduriz 1

Øistein E. Andersen 1

Jordi Atserias 1

Aitziber Atutxa 1

Dorothee Beerman 1

Yvonne Canning 1

Jackie Cassell 1

Stephen Clark 1

Anne Copestake 1

Ann Copestake 1

Siobhan Devlin 1

Arantza Díaz de Ilarraza 1

Julia Hockenmaier 1

Uxoa Iñurrieta 1

Janne Bondi Johannessen 1

Aravind Joshi 1

Ronald M. Kaplan 1

Tracy Holloway King 1

Hans-Ulrich Krieger 1

Sandra Kübler 1

Christopher D. Manning 1

Robert C. Moore 1

Shinsuke Mori 1

Julien Nioche 1

Torbjørn Nordgård 1

Octavian Popescu 1

Richard Power 1

Judita Preiss 1

Jonathon Read 1

Victoria Rosén 1

Kepa Sarasola 1

Aleksandar Savkov 1

Xinglong Wang 1

Yoko Yamakata 1

Kees van Deemter 1

Josef van Genabith 1

Venues