Rodolfo Delmonte - ACL Anthology

This is an internal, incomplete preview of a proposed change to the ACL Anthology. For efficiency reasons, we don't generate MODS or Endnote formats, and the preview may be incomplete in other ways, or contain mistakes. Do not treat this content as an official publication.

Rodolfo Delmonte

Also published as: R. Delmonte

2024

La Non Canonica L’hai Studiata? Exploring LLMs and Sentence Canonicity in Italian
Claudiu Daniel Hromei | Danilo Croce | Rodolfo Delmonte | Roberto Basili
Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024)

This paper investigates the ability of Large Language Models (LLMs) to differentiate between canonical and non-canonical sentences in Italian, employing advanced neural architectures like LLaMA and its adaptations. Canonical sentences adhere to the standard Subject-Verb-Object (SVO) structure. We hypothesize that recent generative LLMs are influenced heavily by the English language, where non-canonical structures are very rare. Using the in-context learning technique, we probe these models and further fine-tune them for this specific task. Initial results indicate that these models continue to struggle with this task even after fine-tuning. Additionally, we introduce a new dataset comprising several hundred sentences from the poetry domain, which presents significant challenges for the canonical structure task.

2023

When the Lab of CompLing Was Started at the University of Venice - Preface to the Proceedings of the First Workshop Held in 1982
Rodolfo Delmonte
Proceedings of the Ninth Italian Conference on Computational Linguistics (CLiC-it 2023)

2022

Measuring Similarity by Linguistic Features rather than Frequency
Rodolfo Delmonte | Nicolò Busetto
Proceedings of the 18th Joint ACL - ISO Workshop on Interoperable Semantic Annotation within LREC2022

In the use and creation of current Deep Learning Models the only number that is used for the overall computation is the frequency value associated with the current word form in the corpus, which is used to substitute it. Frequency values come in two forms: absolute and relative. Absolute frequency is used indirectly when selecting the vocabulary against which the word embeddings are created: the cutoff threshold is usually fixed at 30/50K entries of the most frequent words. Relative frequency comes in directly when computing word embeddings based on co-occurrence values of the tokens included in a window size 2/5 adjacent tokens. The latter values are then used to compute similarity, mostly based on cosine distance. In this paper we will evaluate the impact of these two frequency parameters on a small corpus of Italian sentences whose main features are two: presence of very rare words and of non-canonical structures. Rather than basing our evaluation on cosine measure alone, we propose a graded scale of scores which are linguistically motivated. The results computed on the basis of a perusal of BERT’s raw embeddings shows that the two parameters conspire to decide the level of predictability.

2021

Converting Multilayer Glosses into Semantic and Pragmatic forms with GENLIS
Rodolfo Delmonte | Serena Trolvi | Francesco Stiffoni
Proceedings of the 17th Joint ACL - ISO Workshop on Interoperable Semantic Annotation

This paper presents work carried out to transform glosses of a fable in Italian Sign Language (LIS) into a text which is then read by a TTS synthesizer from an SSML modified version of the same text. Whereas many systems exist that generate sign language from a text, we decided to do the reverse operation and generate text from LIS. For that purpose we used a version of the fable The Tortoise and the Hare, signed and made available on Youtube by ALBA cooperativa sociale, which was annotated manually by second author for her master’s thesis. In order to achieve our goal, we converted the multilayer glosses into linear Prolog terms to be fed to the generator. In the paper we focus on the main problems encountered in the transformation of the glosses into a semantically and pragmatically consistent representation. The main problems have been caused by the complexities of a text like a fable which requires coreference mechanisms and speech acts to be implemented in the representation which are often unexpressed and constitute implicit information.

2020

Annotating a Fable in Italian Sign Language (LIS)
Serena Trolvi | Rodolfo Delmonte
Proceedings of the Twelfth Language Resources and Evaluation Conference

This paper introduces work carried out for the automatic generation of a written text in Italian starting from glosses of a fable in Italian Sign Language (LIS). The paper gives a brief overview of sign languages (SLs) and some peculiarities of SL fables such as the use of space, the strategy of Role Shift and classifiers. It also presents the annotation of the fable “The Tortoise and the Hare” - signed in LIS and made available by Alba Cooperativa Sociale -, which was annotated manually by first author for her master’s thesis. The annotation was the starting point of a generation process that allowed us to automatically generate a text in Italian starting from LIS glosses. LIS sentences have been transcribed with Italian words into tables on simultaneous layers, each of which contains specific linguistic or non-linguistic pieces of information. In addition, the present work discusses problems encountered in the annotation and generation process.

2019

Annotating Shakespeare’s Sonnets with Appraisal Theory to Detect Irony
Nicolò Busetto | Rodolfo Delmonte
Proceedings of the Sixth Italian Conference on Computational Linguistics (CLiC-it 2019)

Detecting Irony in Shakespeare’s Sonnets with SPARSAR
Rodolfo Delmonte | Nicolò Busetto
Proceedings of the Sixth Italian Conference on Computational Linguistics (CLiC-it 2019)

2017

A Semantically-Based Computational Approach to Narrative Structure
Rodolfo Delmonte | Giulia Marchesini
Proceedings of the 12th International Conference on Computational Semantics (IWCS) — Short papers

A semantically-based approach to the annotation of narrative style
Rodolfo Delmonte | Giulia Marchesi
Proceedings of the 13th Joint ISO-ACL Workshop on Interoperable Semantic Annotation (ISA-13)

2016

VENSESEVAL at Semeval-2016 Task 2 iSTS - with a full-fledged rule-based approach
Rodolfo Delmonte
Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)

Syntactic and Lexical Complexity in Italian Noncanonical Structures
Rodolfo Delmonte
Proceedings of the Workshop on Computational Linguistics for Linguistic Complexity (CL4LC)

In this paper we will be dealing with different levels of complexity in the processing of Italian, a Romance language inheriting many properties from Latin which make it an almost free word order language . The paper is concerned with syntactic complexity as measurable on the basis of the cognitive parser that incrementally builds up a syntactic representation to be used by the semantic component. The theory behind will be LFG and parsing preferences will be used to justify one choice both from a principled and a processing point of view. LFG is a transformationless theory in which there is no deep structure separate from surface syntactic structure. This is partially in accordance with constructional theories in which noncanonical structures containing non-argument functions FOCUS/TOPIC are treated as multifunctional constituents. Complexity is computed on a processing basis following suggestions made by Blache and demonstrated by Kluender and Chesi

2015

Visualizing Poetry with SPARSAR – Visual Maps from Poetic Content
Rodolfo Delmonte
Proceedings of the Fourth Workshop on Computational Linguistics for Literature

Semantics and Discourse Processing for Expressive TTS
Rodolfo Delmonte | Rocco Tripodi
Proceedings of the First Workshop on Linking Computational Models of Lexical, Sentential and Discourse-level Semantics

2014

SPARSAR: An Expressive Poetry Reader
Rodolfo Delmonte | Anton Maria Prati
Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics

2013

Coping With Implicit Arguments And Events Coreference
Rodolfo Delmonte
Workshop on Events: Definition, Detection, Coreference, and Representation

2011

Une Suite d’interaction de fouille basée sur la compréhension du langage naturel (An Interaction Mining Suite Based On Natural Language Understanding)
Rodolfo Delmonte | Vincenzo Pallotta | Violeta Seretan | Lammert Vrieling | David Walker
Actes de la 18e conférence sur le Traitement Automatique des Langues Naturelles. Démonstrations

Desperately Seeking Implicit Arguments in Text
Sara Tonelli | Rodolfo Delmonte
Proceedings of the ACL 2011 Workshop on Relational Models of Semantics

2010

Deep Linguistic Processing with GETARUNS for Spoken Dialogue Understanding
Rodolfo Delmonte | Antonella Bristot | Vincenzo Pallotta
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

In this paper we will present work carried out to scale up the system for text understanding called GETARUNS, and port it to be used in dialogue understanding. The current goal is that of extracting automatically argumentative information in order to build argumentative structure. The long term goal is using argumentative structure to produce automatic summarization of spoken dialogues. Very much like other deep linguistic processing systems, our system is a generic text/dialogue understanding system that can be used in connection with an ontology ― WordNet - and other similar repositories of commonsense knowledge. We will present the adjustments we made in order to cope with transcribed spoken dialogues like those produced in the ICSI Berkeley project. In a final section we present preliminary evaluation of the system on two tasks: the task of automatic argumentative labeling and another frequently addressed task: referential vs. non-referential pronominal detection. Results obtained fair much higher than those reported in similar experiments with machine learning approaches.

VenPro: A Morphological Analyzer for Venetan
Sara Tonelli | Emanuele Pianta | Rodolfo Delmonte | Michele Brunelli
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

This document reports the process of extending MorphoPro for Venetan, a lesser-used language spoken in the Nort-Eastern part of Italy. MorphoPro is the morphological component of TextPro, a suite of tools oriented towards a number of NLP tasks. In order to extend this component to Venetan, we developed a declarative representation of the morphological knowledge necessary to analyze and synthesize Venetan words. This task was challenging for several reasons, which are common to a number of lesser-used languages: although Venetan is widely used as an oral language in everyday life, its written usage is very limited; efforts for defining a standard orthography and grammar are very recent and not well established; despite recent attempts to propose a unified orthography, no Venetan standard is widely used. Besides, there are different geographical varieties and it is strongly influenced by Italian.

VENSES++: Adapting a deep semantic processing system to the identification of null instantiations
Sara Tonelli | Rodolfo Delmonte
Proceedings of the 5th International Workshop on Semantic Evaluation

Opinion Mining, Subjectivity and Factuality
Rodolfo Delmonte
Proceedings of the Australasian Language Technology Association Workshop 2010

2009

Scaling up a NLU system from text to dialogue understanding
Rodolfo Delmonte | Antonella Bristot | Gloria Voltolina | Vincenzo Pallotta
Proceedings of the Workshop on Software Engineering, Testing, and Quality Assurance for Natural Language Processing (SETQA-NLP 2009)

Computing Implicit Entities and Events with Getaruns (short paper)
Rodolfo Delmonte | Emanuele Pianta
Proceedings of the Eight International Conference on Computational Semantics

2008

Reranking Google with GReG
Rodolfo Delmonte | Marco Aldo Piccolino Boniforti
Proceedings of the 4th Web as Corpus Workshop

We present an experiment evaluating the contribution of a system called GReG for reranking the snippets returned by Google’s search engine in the 10 best links presented to the user, captured by the use of Google’s API. The evaluation aims at establishing whether or not the introduction of deep linguistic information may improve the accuracy of Google or rather it is the opposite case as maintained by the majority of people working in Information Retrieval, using a Bag Of Words approach. We used 900 questions, answers taken from TREC 8, 9 competitions, execute three different types of evaluation: one without any linguistic aid; a second one with tagging, syntactic constituency contribution; another run with what we call Partial Logical Form. Even though GReG is still work in progress, it is possible to draw clearcut conclusions: adding linguistic information to the evaluation process of the best snippet that can answer a question improves enormously the performance. In another experiment we used the actual associated to the Q/A pairs distributed by one of TREC’s participant, got even higher accuracy.

Enriching the Venice Italian Treebank with Dependency and Grammatical Relations
Sara Tonelli | Rodolfo Delmonte | Antonella Bristot
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

In this paper we propose a rule-based approach to extract dependency and grammatical functions from the Venice Italian Treebank, a Treebank of written text with PoS and constituent labels consisting of 10,200 utterances and about 274,000 tokens. As manual corpus annotation is expensive and time-consuming, we decided to exploit this existing constituency-based Treebank to derive dependency structures with lower effort. After describing the procedure to extract heads and dependents, based on a head percolation table for Italian, we introduce the rules adopted to add grammatical relation labels. To this purpose, we manually relabeled all non-canonical arguments, which are very frequent in Italian, then we automatically labeled the remaining complements or arguments following some syntactic restrictions based on the position of the constituents w.r.t to parent and sibling nodes. The final section of the paper describes evaluation results. Evaluation was carried out in two steps, one for dependency relations and one for grammatical roles. Results are in line with similar conversion algorithms carried out for other languages, with 0.97 precision on dependency arcs and F-measure for the main grammatical functions scoring 0.96 or above, except for obliques with 0.75.

Semantics in Text Processing. STEP 2008 Conference Proceedings
Johan Bos | Rodolfo Delmonte
Semantics in Text Processing. STEP 2008 Conference Proceedings

Answering Why-Questions in Closed Domains from a Discourse Model
Rodolfo Delmonte | Emanuele Pianta
Semantics in Text Processing. STEP 2008 Conference Proceedings

Semantic and Pragmatic Computing with GETARUNS
Rodolfo Delmonte
Semantics in Text Processing. STEP 2008 Conference Proceedings

2007

Entailment and Anaphora Resolution in RTE3
Rodolfo Delmonte | Antonella Bristot | Marco Aldo Piccolino Boniforti | Sara Tonelli
Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing

2006

Hybrid Systems for Information Extraction and Question Answering
Rodolfo Delmonte
Proceedings of the Workshop on How Can Computational Linguistics Improve Information Retrieval?

Another Evaluation of Anaphora Resolution Algorithms and a Comparison with GETARUNS’ Knowledge Rich Approach
Rodolfo Delmonte | Antonella Bristot | Marco Aldo Piccolino Boniforti | Sara Tonelli
Proceedings of the Workshop on ROMAND 2006:Robust Methods in Analysis of Natural language Data

2004

Text Understanding with GETARUNS for Q/A and Summarization
Rodolfo Delmonte
Proceedings of the 2nd Workshop on Text Meaning and Interpretation

Evaluating GETARUNS parser with GREVAL test suite
Rodolfo Delmonte
Proceedings of the 3rd workshop on RObust Methods in Analysis of Natural Language Data (ROMAND 2004)

2002

Relative Clause Attachment and Anaphora: A Case for Short Binding
Rodolfo Delmonte
Proceedings of the Sixth International Workshop on Tree Adjoining Grammar and Related Frameworks (TAG+6)

2001

How to Integrate Linguistic information in FILES and generate feedback for grammar errors
R. Delmonte | L. Chiran | C. Bacalu
Proceedings of the ACL 2001 Workshop on Sharing Tools and Resources

2000

Generating from a discourse model
Rodolfo Delmonte | Dario Bianchi | Emanuele Pianta
Proceedings of the International Conference on Machine Translation and Multilingual Applications in the new Millennium: MT 2000

Shallow Parsing and Functional Structure in Italian Corpora
Rodolfo Delmonte
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)

Elementary trees for syntactic and statistical disambiguation
Rodolfo Delmonte | Luminita Chiran | Ciprian Bacalu
Proceedings of the Fifth International Workshop on Tree Adjoining Grammar and Related Frameworks (TAG+5)

1993

Undestanding Stories in Different Languages with GETA-RUN
Dario Bianchi | Rodolfo Delmonte | Emanuele Pianta
Sixth Conference of the European Chapter of the Association for Computational Linguistics

1985

Parsing Difficulties & Phonological Processing in Italian
Rodolfo Delmonte
Second Conference of the European Chapter of the Association for Computational Linguistics

1983

A Phonological Processor for Italian
Rodolfo Delmonte
First Conference of the European Chapter of the Association for Computational Linguistics