LexFr: Adapting the LexIt Framework to Build a Corpus-based French Subcategorization Lexicon
Giulia Rambelli, Gianluca Lebani, Laurent Prévot, Alessandro Lenci
Abstract
This paper introduces LexFr, a corpus-based French lexical resource built by adapting the framework LexIt, originally developed to describe the combinatorial potential of Italian predicates. As in the original framework, the behavior of a group of target predicates is characterized by a series of syntactic (i.e., subcategorization frames) and semantic (i.e., selectional preferences) statistical information (a.k.a. distributional profiles) whose extraction process is mostly unsupervised. The first release of LexFr includes information for 2,493 verbs, 7,939 nouns and 2,628 adjectives. In these pages we describe the adaptation process and evaluated the final resource by comparing the information collected for 20 test verbs against the information available in a gold standard dictionary. In the best performing setting, we obtained 0.74 precision, 0.66 recall and 0.70 F-measure.- Anthology ID:
- L16-1148
- Volume:
- Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
- Month:
- May
- Year:
- 2016
- Address:
- Portorož, Slovenia
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 930–937
- Language:
- URL:
- https://aclanthology.org/L16-1148
- DOI:
- Cite (ACL):
- Giulia Rambelli, Gianluca Lebani, Laurent Prévot, and Alessandro Lenci. 2016. LexFr: Adapting the LexIt Framework to Build a Corpus-based French Subcategorization Lexicon. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 930–937, Portorož, Slovenia. European Language Resources Association (ELRA).
- Cite (Informal):
- LexFr: Adapting the LexIt Framework to Build a Corpus-based French Subcategorization Lexicon (Rambelli et al., LREC 2016)
- PDF:
- https://preview.aclanthology.org/nodalida-main-page/L16-1148.pdf