From medical language processing to BioNLP domain

Gabriella Pardelli, Manuela Sassi, Sara Goggi, Stefania Biagioni


Abstract
This paper presents the results of a terminological work on a reference corpus in the domain of Biomedicine. In particular, the research tends to analyse the use of certain terms in Biomedicine in order to verify their change over the time with the aim of retrieving from the net the very essence of documentation. The terminological sample contains words used in BioNLP and biomedicine and identifies which terms are passing from scientific publications to the daily press and which are rather reserved to scientific production. The final scope of this work is to determine how scientific dissemination to an ever larger part of the society enables a public of common citizens to approach communication on biomedical research and development; and its main source is a reference corpus made up of three main repositories from which information related to BioNLP and Biomedicine is extracted. The paper is divided in three sections: 1) an introduction dedicated to data extracted from scientific documentation; 2) the second section devoted to methodology and data description; 3) the third part containing a statistical representation of terms extracted from the archive: indexes and concordances allow to reflect on the use of certain terms in this field and give possible keys for having access to the extraction of knowledge in the digital era.
Anthology ID:
L12-1402
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2049–2055
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/687_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Gabriella Pardelli, Manuela Sassi, Sara Goggi, and Stefania Biagioni. 2012. From medical language processing to BioNLP domain. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 2049–2055, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
From medical language processing to BioNLP domain (Pardelli et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/687_Paper.pdf