Branimir Boguraev

Also published as: Bran Boguraev, Branimir K. Boguraev, B.K. Boguraev

2015

Learning Structures of Negations from Flat Annotations
Vinodkumar Prabhakaran | Branimir Boguraev
Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics

pdf bib

The Bare Necessities: Increasing Lexical Coverage for Multi-Word Domain Terms with Less Lexical Data
Branimir Boguraev | Esme Manandise | Benjamin Segal
Proceedings of the 11th Workshop on Multiword Expressions

2008

pdf bib abs

Navigating through Dense Annotation Spaces
Branimir Boguraev | Mary Neff
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

Pattern matching, or querying, over annotations is a general purpose paradigm for inspecting, navigating, mining, and transforming annotation repositories - the common representation basis for modern pipelined text-processing frameworks. Configurability of such frameworks and expressiveness of feature structure-based annotation schemes account for the high density of some such annotation repositories. This particular characteristic makes challenging the design of a pattern matching engine, capable of interpreting (or imposing) flat patterns over an arbitrarily dense annotation lattice. We present an approach where a finite state device carries out the application of (compiled) grammars over what is, in effect, a linearized projection of a unique route through the lattice; a route derived by a mix of static pattern (grammar) analysis and interpretation of navigational directives within the extended grammar formalism. Our approach achieves a mix of finite state scanning and lattice traversal for expressive and efficient pattern matching in dense annotations stores.

pdf bib abs

A Development Environment for Configurable Meta-Annotators in a Pipelined NLP Architecture
Youssef Drissi | Branimir Boguraev | David Ferrucci | Paul Keyser | Anthony Levas
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

Information extraction from large data repositories is critical to Information Management solutions. In addition to prerequisite corpus analysis, to determine domain-specific characteristics of text resources, developing, refining and evaluating analytics entails a complex and lengthy process, typically requiring more than just domain expertise. Modern architectures for text processing, while facilitating reuse and (re-)composition of analytical pipelines, do place additional constraints upon the analytics development, as domain experts need not only configure individual annotator components, but situate these within a fully functional annotator pipeline. We present the design, and current status, of a tool for configuring model-driven annotators, which abstracts away from annotator implementation details, pipeline composition constraints, and data management. Instead, the tool embodies support for all stages of ontology-centric model development cycle from corpus analysis and concept definition, to model development and testing, to large scale evaluation, to easy and rapid composition of text applications deploying these concept models. With our design, we aim to meet the needs of domain experts, who are not necessarily expert NLP practitioners.

2007

pdf bib

2006

pdf bib abs

Analysis of TimeBank as a Resource for TimeML Parsing
Branimir Boguraev | Rie Kubota Ando
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

In our work, we present an analysis of the TimeBank corpus---the only available reference sample of TimeML-compliant annotation---from the point of view of its utility as a training resource for developing automated TimeML annotators. We are encouraged by experimental results indicative of the potential of TimeBank; at the same time, closer inspection of causes for some systematic errors shows off certain deficiencies in the corpus, primarily to do with small size and inconsistent annotation. Our analysis suggests that even a reference resource, developed outside of a rigorous process of training corpus design and creation, can be extremely valuable for training and development purposes. The analysis also highlights areas of correction and improvement for evolving the current reference corpus into a community infrastructure resource.

pdf bib

Proceedings of the Workshop on Annotating and Reasoning about Time and Events
Branimir Boguraev | Rafael Muñoz | James Pustejovsky
Proceedings of the Workshop on Annotating and Reasoning about Time and Events