Federico Boschetti
2026
Integrating Services, Platforms and Resources into a National Infrastructure Cluster for FAIR Language and Cultural Data
Giulia Pedonese | Daniele Melaccio | Michele Mallia | Monica Monachini | Francesca Frontini | Valeria Quochi | Fahad Khan | Angelo Mario Del Grosso | Federico Boschetti | Riccardo Del Gratta
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Giulia Pedonese | Daniele Melaccio | Michele Mallia | Monica Monachini | Francesca Frontini | Valeria Quochi | Fahad Khan | Angelo Mario Del Grosso | Federico Boschetti | Riccardo Del Gratta
Proceedings of the Fifteenth Language Resources and Evaluation Conference
In the context of evolving European and national policies for research infrastructure governance, this paper presents the contribution of a national consortium for language resources and technology to the construction of a national infrastructure for FAIR and interoperable language and cultural data within a broader Humanities and Heritage Open Science initiative. As the national node of a European research infrastructure for language resources, the consortium contributes to translating FAIR and Open Science principles into practice by integrating technical, methodological, and training dimensions. Its activities combine several coordinated components: FAIRification workflows and ontology-based metadata mediation to enhance semantic interoperability across infrastructures; the refactoring and exposure of services through a federated API gateway; and the implementation of a Linguistic Linked Open Data (LLOD) pilot for the validation, transformation, and publication of interoperable RDF datasets. A national training ecosystem — comprising a training platform and a FAIR learning library — supports capacity building and the creation of FAIR-by-design learning materials. Finally, a permanent research observatory monitors community practices and needs, providing evidence-based insights for the continuous improvement of services and training provision. Together, these components demonstrate a coherent strategy for implementing FAIR and Open Science at the national level, while ensuring alignment with major European and national initiatives in the SSH data ecosystem.
Automatic Suggestions of Supplements in the Herculaneum Papyri: Language Models and RESTful API
Angelo Mario Del Grosso | Gabriele Giannessi | Simone Zenzaro | Federico Boschetti
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Angelo Mario Del Grosso | Gabriele Giannessi | Simone Zenzaro | Federico Boschetti
Proceedings of the Fifteenth Language Resources and Evaluation Conference
This paper addresses a computational philology task focused on the automatic restoration of textual gaps (i.e., lacunae) in the Herculaneum Papyri, whose Ancient Greek texts are inherently fragmentary due to damage caused by carbonization. The objective of this work is to show the preliminary results concerning the development of a web-based suggestion service for proposing plausible supplements to fill lacunae, thereby supporting the philological process of producing new critical editions within a new web-based digital scholarly editing environment. To automatically provide such suggestions, we have developed systems that generate textual supplements in Ancient Greek, employing both neural (BERT-like) and statistical (n-gram) language modeling approaches.
2025
Enhancing Lexical Resources: Synset Expansion and Cross-Linking Between ItalWordNet and MariTerm
Lucia Galiero | Federico Boschetti | Riccardo Del Gratta | Angelo Mario Del Grosso | Monica Monachini
Proceedings of the 13th Global Wordnet Conference
Lucia Galiero | Federico Boschetti | Riccardo Del Gratta | Angelo Mario Del Grosso | Monica Monachini
Proceedings of the 13th Global Wordnet Conference
2023
Proceedings of the Ninth Italian Conference on Computational Linguistics (CLiC-it 2023)
Federico Boschetti | Gianluca E. Lebani | Bernardo Magnini | Nicole Novielli
Proceedings of the Ninth Italian Conference on Computational Linguistics (CLiC-it 2023)
Federico Boschetti | Gianluca E. Lebani | Bernardo Magnini | Nicole Novielli
Proceedings of the Ninth Italian Conference on Computational Linguistics (CLiC-it 2023)
Annotating Homeric Emotions by a Domain-Specific Language
Federico Boschetti | Laura Chilla | Maria Konstantinidou | John Pavlopoulos
Proceedings of the Ninth Italian Conference on Computational Linguistics (CLiC-it 2023)
Federico Boschetti | Laura Chilla | Maria Konstantinidou | John Pavlopoulos
Proceedings of the Ninth Italian Conference on Computational Linguistics (CLiC-it 2023)
Preface to the CLiC-it 2023 Proceedings
Federico Boschetti | Gianluca E. Lebani | Bernardo Magnini | Nicole Novielli
Proceedings of the Ninth Italian Conference on Computational Linguistics (CLiC-it 2023)
Federico Boschetti | Gianluca E. Lebani | Bernardo Magnini | Nicole Novielli
Proceedings of the Ninth Italian Conference on Computational Linguistics (CLiC-it 2023)
2020
“Voices of the Great War”: A Richly Annotated Corpus of Italian Texts on the First World War
Federico Boschetti | Irene De Felice | Stefano Dei Rossi | Felice Dell’Orletta | Michele Di Giorgio | Martina Miliani | Lucia C. Passaro | Angelica Puddu | Giulia Venturi | Nicola Labanca | Alessandro Lenci | Simonetta Montemagni
Proceedings of the Twelfth Language Resources and Evaluation Conference
Federico Boschetti | Irene De Felice | Stefano Dei Rossi | Felice Dell’Orletta | Michele Di Giorgio | Martina Miliani | Lucia C. Passaro | Angelica Puddu | Giulia Venturi | Nicola Labanca | Alessandro Lenci | Simonetta Montemagni
Proceedings of the Twelfth Language Resources and Evaluation Conference
“Voices of the Great War” is the first large corpus of Italian historical texts dating back to the period of First World War. This corpus differs from other existing resources in several respects. First, from the linguistic point of view it gives account of the wide range of varieties in which Italian was articulated in that period, namely from a diastratic (educated vs. uneducated writers), diaphasic (low/informal vs. high/formal registers) and diatopic (regional varieties, dialects) points of view. From the historical perspective, through a collection of texts belonging to different genres it represents different views on the war and the various styles of narrating war events and experiences. The final corpus is balanced along various dimensions, corresponding to the textual genre, the language variety used, the author type and the typology of conveyed contents. The corpus is fully annotated with lemmas, part-of-speech, terminology, and named entities. Significant corpus samples representative of the different “voices” have also been enriched with meta-linguistic and syntactic information. The layer of syntactic annotation forms the first nucleus of an Italian historical treebank complying with the Universal Dependencies standard. The paper illustrates the final resource, the methodology and tools used to build it, and the Web Interface for navigating it.
2019
Nove Anni di jTEI: What’s New?(Nine Years of jTEI: What’s New?)
Federico Boschetti | Gabriella Pardelli | Giulia Venturi
Proceedings of the Sixth Italian Conference on Computational Linguistics (CLiC-it 2019)
Federico Boschetti | Gabriella Pardelli | Giulia Venturi
Proceedings of the Sixth Italian Conference on Computational Linguistics (CLiC-it 2019)
2017
Designing an Ontology for the Study of Ritual in Ancient Greek Tragedy
Gloria Mugelli | Andrea Bellandi | Federico Boschetti | Anas Fahad Khan
Proceedings of Language, Ontology, Terminology and Knowledge Structures Workshop (LOTKS 2017)
Gloria Mugelli | Andrea Bellandi | Federico Boschetti | Anas Fahad Khan
Proceedings of Language, Ontology, Terminology and Knowledge Structures Workshop (LOTKS 2017)
2016
Ancient Greek WordNet Meets the Dynamic Lexicon: the Example of the Fragments of the Greek Historians
Monica Berti | Yuri Bizzoni | Federico Boschetti | Gregory R. Crane | Riccardo Del Gratta | Tariq Yousef
Proceedings of the 8th Global WordNet Conference (GWC)
Monica Berti | Yuri Bizzoni | Federico Boschetti | Gregory R. Crane | Riccardo Del Gratta | Tariq Yousef
Proceedings of the 8th Global WordNet Conference (GWC)
The Ancient Greek WordNet (AGWN) and the Dynamic Lexicon (DL) are multilingual resources to study the lexicon of Ancient Greek texts and their translations. Both AGWN and DL are works in progress that need accuracy improvement and manual validation. After a detailed description of the current state of each work, this paper illustrates a methodology to cross AGWN and DL data, in order to mutually score the items of each resource according to the evidence provided by the other resource. The training data is based on the corpus of the Digital Fragmenta Historicorum Graecorum (DFHG), which includes ancient Greek texts with Latin translations.
2014
The Making of Ancient Greek WordNet
Yuri Bizzoni | Federico Boschetti | Harry Diakoff | Riccardo Del Gratta | Monica Monachini | Gregory Crane
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Yuri Bizzoni | Federico Boschetti | Harry Diakoff | Riccardo Del Gratta | Monica Monachini | Gregory Crane
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
This paper describes the process of creation and review of a new lexico-semantic resource for the classical studies: AncientGreekWordNet. The candidate sets of synonyms (synsets) are extracted from Greek-English dictionaries, on the assumption that Greek words translated by the same English word or phrase have a high probability of being synonyms or at least semantically closely related. The process of validation and the web interface developed to edit and query the resource are described in detail. The lexical coverage of Ancient Greek WordNet is illustrated and the accuracy is evaluated. Finally, scenarios for exploiting the resource are discussed.
2009
Search
Fix author
Co-authors
- Riccardo Del Gratta 4
- Gregory Crane 3
- Angelo Mario Del Grosso 3
- Monica Monachini 3
- Yuri Bizzoni 2
- Gianluca E. Lebani 2
- Bernardo Magnini 2
- Nicole Novielli 2
- Giulia Venturi 2
- Andrea Bellandi 1
- Monica Berti 1
- Laura Chilla 1
- Irene De Felice 1
- Felice Dell’Orletta 1
- Michele Di Giorgio 1
- Harry Diakoff 1
- Francesca Frontini 1
- Lucia Galiero 1
- Gabriele Giannessi 1
- Fahad Khan 1
- Fahad Khan 1
- Maria Konstantinidou 1
- Nicola Labanca 1
- Alessandro Lenci 1
- Michele Mallia 1
- Daniele Melaccio 1
- Martina Miliani 1
- Simonetta Montemagni 1
- Gloria Mugelli 1
- Gabriella Pardelli 1
- Lucia C. Passaro 1
- John Pavlopoulos 1
- Giulia Pedonese 1
- Angelica Puddu 1
- Valeria Quochi 1
- Matteo Romanello 1
- Stefano Dei Rossi 1
- Tariq Yousef 1
- Simone Zenzaro 1