Abstract
In most Indo-European languages, many biomedical terms are rich morphological structures composed of several constituents mainly originating from Greek or Latin. The interpretation of these compounds are keystones to access information. In this paper, we present morphological resources aiming at coping with these biomedical morphological compounds. Following previous work (Claveau et al. 2011,Claveau et al. 12), these resources are automatically built using Japanese terms in Kanjis as a pivot language and alignment techniques. We show how these alignment information can be used for segmenting compounds, attaching semantic interpretation to each part, proposing definitions (gloses) of the compounds... When possible, these tasks are compared with state-of-the-art tools, and the results show the interest of our automatically built probabilistic resources.- Anthology ID:
- L14-1004
- Volume:
- Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
- Month:
- May
- Year:
- 2014
- Address:
- Reykjavik, Iceland
- Editors:
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 3348–3354
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2014/pdf/1003_Paper.pdf
- DOI:
- Cite (ACL):
- Vincent Claveau and Ewa Kijak. 2014. Generating and using probabilistic morphological resources for the biomedical domain. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 3348–3354, Reykjavik, Iceland. European Language Resources Association (ELRA).
- Cite (Informal):
- Generating and using probabilistic morphological resources for the biomedical domain (Claveau & Kijak, LREC 2014)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2014/pdf/1003_Paper.pdf