Nuno Mamede

Also published as: Nuno J. Mamede


Support Verb Constructions across the Ocean Sea
Jorge Baptista | Nuno Mamede | Sónia Reis
Proceedings of the 18th Workshop on Multiword Expressions @LREC2022

This paper analyses the support (or light) verb constructions (SVC) in a publicly available, manually annotated corpus of multiword expressions (MWE) in Brazilian Portuguese. The paper highlights several issues in the linguistic definitions therein adopted for these types of MWE, and reports the results from applying STRING, a rule-based parsing system, originally developed for European Portuguese, to this corpus from Brazilian Portuguese. The goal is two-fold: to improve the linguistic definition of SVC in the annotation task, as well as to gauge the major difficulties found when transposing linguistic resources between these two varieties of the same language.


metaTED: a Corpus of Metadiscourse for Spoken Language
Rui Correia | Nuno Mamede | Jorge Baptista | Maxine Eskenazi
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)

This paper describes metaTED ― a freely available corpus of metadiscursive acts in spoken language collected via crowdsourcing. Metadiscursive acts were annotated on a set of 180 randomly chosen TED talks in English, spanning over different speakers and topics. The taxonomy used for annotation is composed of 16 categories, adapted from Adel(2010). This adaptation takes into account both the material to annotate and the setting in which the annotation task is performed. The crowdsourcing setup is described, including considerations regarding training and quality control. The collected data is evaluated in terms of quantity of occurrences, inter-annotator agreement, and annotation related measures (such as average time on task and self-reported confidence). Results show different levels of agreement among metadiscourse acts (α ∈ [0.15; 0.49]). To further assess the collected material, a subset of the annotations was submitted to expert appreciation, who validated which of the marked occurrences truly correspond to instances of the metadiscursive act at hand. Similarly to what happened with the crowd, experts revealed different levels of agreement between categories (α ∈ [0.18; 0.72]). The paper concludes with a discussion on the applicability of metaTED with respect to each of the 16 categories of metadiscourse.


Lexical Level Distribution of Metadiscourse in Spoken Language
Rui Correia | Maxine Eskenazi | Nuno Mamede
Proceedings of the First Workshop on Linking Computational Models of Lexical, Sentential and Discourse-level Semantics

Integrating support verb constructions into a parser
Amanda Rassi | Jorge Baptista | Nuno Mamede | Oto Vale
Proceedings of the 10th Brazilian Symposium in Information and Human Language Technology


The fuzzy boundaries of operator verb and support verb constructions with dar “give” and ter “have” in Brazilian Portuguese
Amanda Rassi | Cristina Santos-Turati | Jorge Baptista | Nuno Mamede | Oto Vale
Proceedings of Workshop on Lexical and Grammatical Resources for Language Processing

Revising the annotation of a Broadcast News corpus: a linguistic approach
Vera Cabarrão | Helena Moniz | Fernando Batista | Ricardo Ribeiro | Nuno Mamede | Hugo Meinedo | Isabel Trancoso | Ana Isabel Mata | David Martins de Matos
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

This paper presents a linguistic revision process of a speech corpus of Portuguese broadcast news focusing on metadata annotation for rich transcription, and reports on the impact of the new data on the performance for several modules. The main focus of the revision process consisted on annotating and revising structural metadata events, such as disfluencies and punctuation marks. The resultant revised data is now being extensively used, and was of extreme importance for improving the performance of several modules, especially the punctuation and capitalization modules, but also the speech recognition system, and all the subsequent modules. The resultant data has also been recently used in disfluency studies across domains.


pdf bib
Language Dynamics and Capitalization using Maximum Entropy
Fernando Batista | Nuno Mamede | Isabel Trancoso
Proceedings of ACL-08: HLT, Short Papers

Using Lexical Acquisition to Enrich a Predicate Argument Reusable Database
Paula Cristina Vaz | David Martins de Matos | Nuno J. Mamede
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

The work described in this paper aims to enrich the noun classifications of an existing database of lexical resources (de Matos and Ribeiro, 2004) adding missing information such as semantic relations. Relations are extracted from an annotated and manually corrected corpus. Semantic relations added to the database are retrieved from noun-appositive relations found in the corpus. The method uses clustering to generate labeled sets of words with hypernym relations between set label and set elements.

Reengineering a Domain-Independent Framework for Spoken Dialogue Systems
Filipe M. Martins | Ana Mendes | Mácio Freitas Viveiros | Joana Paulo Pardal | Pedro Arez | Nuno J. Mamede | João Paulo Neto
Software Engineering, Testing, and Quality Assurance for Natural Language Processing


A Framework to Integrate Ubiquitous Knowledge Modeling
Porfírio Filipe | Nuno Mamede
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

This paper describes our contribution to let end users configure mixed-initiative spoken dialogue systems to suit their personalized goals. The main problem that we want to address is the reconfiguration of spoken language dialogue systems to deal with generic plug and play artifacts. Such reconfiguration can be seen as a portability problem and is a critical research issue. In order to solve this problem we describe a hybrid approach to design ubiquitous domain models that allows the dialogue system to perform recognition of available tasks on the fly. Our approach considers two kinds of domain knowledge: the global knowledge and the local knowledge. The global knowledge, that is modeled using a top-down approach, is associated at design time with the dialogue system itself. The local knowledge, that is modeled using a bottom-up approach, is defined with each one of the artifacts. When an artifact is activated or deactivated, a bilateral process, supported by a broker, updates the domain knowledge considering the artifact local knowledge. We assume that everyday artifacts are augmented with computational capabilities and semantic descriptions supported by their own knowledge model. A case study focusing a microwave oven is depicted.


Rethinking Reusable Resources
David M. de Matos | Ricardo Ribeiro | Nuno J. Mamede
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

From a Surface Analysis to a Dependency Structure
Luisa Coheur | Nuno Mamede | Gabriel G. Bes
Proceedings of the Workshop on Recent Advances in Dependency Grammar

A step towards incremental generation of logical forms
Luísa Coheur | Nuno Mamede | Gabriel Bès
Proceedings of the 3rd workshop on RObust Methods in Analysis of Natural Language Data (ROMAND 2004)