Stefano Montanelli
2024
REAL: A Retrieval-Augmented Entity Linking Approach for Biomedical Concept Recognition
Darya Shlyk
|
Tudor Groza
|
Marco Mesiti
|
Stefano Montanelli
|
Emanuele Cavalleri
Proceedings of the 23rd Workshop on Biomedical Natural Language Processing
Large Language Models (LLMs) offer an appealing alternative to training dedicated models for many Natural Language Processing (NLP) tasks. However, outdated knowledge and hallucination issues can be major obstacles in their application in knowledge-intensive biomedical scenarios. In this study, we consider the task of biomedical concept recognition (CR) from unstructured scientific literature and explore the use of Retrieval Augmented Generation (RAG) to improve accuracy and reliability of the LLM-based biomedical CR. Our approach, named REAL (Retrieval Augmented Entity Linking), combines the generative capabilities of LLMs with curated knowledge bases to automatically annotate natural language texts with concepts from bio-ontologies. By applying REAL to benchmark corpora on phenotype concept recognition, we show its effectiveness in improving LLM-based CR performance. This research highlights the potential of combining LLMs with external knowledge sources to advance biomedical text processing.
2022
What is Done is Done: an Incremental Approach to Semantic Shift Detection
Francesco Periti
|
Alfio Ferrara
|
Stefano Montanelli
|
Martin Ruskov
Proceedings of the 3rd Workshop on Computational Approaches to Historical Language Change
Contextual word embedding techniques for semantic shift detection are receiving more and more attention. In this paper, we present What is Done is Done (WiDiD), an incremental approach to semantic shift detection based on incremental clustering techniques and contextual embedding methods to capture the changes over the meanings of a target word along a diachronic corpus. In WiDiD, the word contexts observed in the past are consolidated as a set of clusters that constitute the “memory” of the word meanings observed so far. Such a memory is exploited as a basis for subsequent word observations, so that the meanings observed in the present are stratified over the past ones.
2017
Unsupervised Detection of Argumentative Units though Topic Modeling Techniques
Alfio Ferrara
|
Stefano Montanelli
|
Georgios Petasis
Proceedings of the 4th Workshop on Argument Mining
In this paper we present a new unsupervised approach, “Attraction to Topics” – A2T , for the detection of argumentative units, a sub-task of argument mining. Motivated by the importance of topic identification in manual annotation, we examine whether topic modeling can be used for performing unsupervised detection of argumentative sentences, and to what extend topic modeling can be used to classify sentences as claims and premises. Preliminary evaluation results suggest that topic information can be successfully used for the detection of argumentative sentences, at least for corpora used for evaluation. Our approach has been evaluated on two English corpora, the first of which contains 90 persuasive essays, while the second is a collection of 340 documents from user generated content.
Search
Co-authors
- Alfio Ferrara 2
- Georgios Petasis 1
- Darya Shlyk 1
- Tudor Groza 1
- Marco Mesiti 1
- show all...