Carlo Tasso


2020

pdf
Keyphrase Generation with GANs in Low-Resources Scenarios
Giuseppe Lancioni | Saida S.Mohamed | Beatrice Portelli | Giuseppe Serra | Carlo Tasso
Proceedings of SustaiNLP: Workshop on Simple and Efficient Natural Language Processing

Keyphrase Generation is the task of predicting Keyphrases (KPs), short phrases that summarize the semantic meaning of a given document. Several past studies provided diverse approaches to generate Keyphrases for an input document. However, all of these approaches still need to be trained on very large datasets. In this paper, we introduce BeGanKP, a new conditional GAN model to address the problem of Keyphrase Generation in a low-resource scenario. Our main contribution relies in the Discriminator’s architecture: a new BERT-based module which is able to distinguish between the generated and humancurated KPs reliably. Its characteristics allow us to use it in a low-resource scenario, where only a small amount of training data are available, obtaining an efficient Generator. The resulting architecture achieves, on five public datasets, competitive results with respect to the state-of-the-art approaches, using less than 1% of the training data.

2018

pdf
Predicting the Usefulness of Amazon Reviews Using Off-The-Shelf Argumentation Mining
Marco Passon | Marco Lippi | Giuseppe Serra | Carlo Tasso
Proceedings of the 5th Workshop on Argument Mining

Internet users generate content at unprecedented rates. Building intelligent systems capable of discriminating useful content within this ocean of information is thus becoming a urgent need. In this paper, we aim to predict the usefulness of Amazon reviews, and to do this we exploit features coming from an off-the-shelf argumentation mining system. We argue that the usefulness of a review, in fact, is strictly related to its argumentative content, whereas the use of an already trained system avoids the costly need of relabeling a novel dataset. Results obtained on a large publicly available corpus support this hypothesis.

2017

pdf
Exploiting and Evaluating a Supervised, Multilanguage Keyphrase Extraction pipeline for under-resourced languages
Marco Basaldella | Muhammad Helmy | Elisa Antolli | Mihai Horia Popescu | Giuseppe Serra | Carlo Tasso
Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017

This paper evaluates different techniques for building a supervised, multilanguage keyphrase extraction pipeline for languages which lack a gold standard. Starting from an unsupervised English keyphrase extraction pipeline, we implement pipelines for Arabic, Italian, Portuguese, and Romanian, and we build test collections for languages which lack one. Then, we add a Machine Learning module trained on a well-known English language corpus and we evaluate the performance not only over English but on the other languages as well. Finally, we repeat the same evaluation after training the pipeline over an Arabic language corpus to check whether using a language-specific corpus brings a further improvement in performance. On the five languages we analyzed, results show an improvement in performance when using a machine learning algorithm, even if such algorithm is not trained and tested on the same language.

2016

pdf
Evaluating anaphora and coreference resolution to improve automatic keyphrase extraction
Marco Basaldella | Giorgia Chiaradia | Carlo Tasso
Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers

In this paper we analyze the effectiveness of using linguistic knowledge from coreference and anaphora resolution for improving the performance for supervised keyphrase extraction. In order to verify the impact of these features, we define a baseline keyphrase extraction system and evaluate its performance on a standard dataset using different machine learning algorithms. Then, we consider new sets of features by adding combinations of the linguistic features we propose and we evaluate the new performance of the system. We also use anaphora and coreference resolution to transform the documents, trying to simulate the cohesion process performed by the human mind. We found that our approach has a slightly positive impact on the performance of automatic keyphrase extraction, in particular when considering the ranking of the results.

1991

pdf
Teaching the English Tense: Integrating Naive and Formal Grammars in an Intelligent Tutor for Foreign Language Teaching
Danilo Fum | Bruno Pani | Carlo Tasso
Fifth Conference of the European Chapter of the Association for Computational Linguistics

1989

pdf
Tense Generation in an Intelligent Tutor for Foreign Language Teaching: Some Issues in the Design of the Verb Expert
Danilo Fum | Paolo Giangrandi | Carlo Tasso
Fourth Conference of the European Chapter of the Association for Computational Linguistics

1988

pdf
A Distributed Multi-Agent Architecture for Natural Language Processing
Danilo Fum | Giovanni Guida | Carlo Tasso
Coling Budapest 1988 Volume 2: International Conference on Computational Linguistics

1986

pdf
Tailoring Importance Evaluation to Reader’s Goals: A Contribution to Descriptive Text Summarization
Danilo Fum | Giovanni Guida | Carlo Tasso
Coling 1986 Volume 1: The 11th International Conference on Computational Linguistics

1985

pdf
A Rule-Based Approach to Evaluating Importance in Descriptive Texts
Danilo Fum | Giovanni Guida | Carlo Tasso
Second Conference of the European Chapter of the Association for Computational Linguistics

1983

pdf
IR-NLI : An Expert Natural Language Interface to Online Data Bases
Giovanni Guida | Carlo Tasso
First Conference on Applied Natural Language Processing

1982

pdf
Forward and Backward Reasoning in Automatic Abstracting
Danilo Fum | Giovanni Guida | Carlo Tasso
Coling 1982: Proceedings of the Ninth International Conference on Computational Linguistics