LexFr: Adapting the LexIt Framework to Build a Corpus-based French Subcategorization Lexicon

Giulia Rambelli, Gianluca Lebani, Laurent Prévot, Alessandro Lenci


Abstract
This paper introduces LexFr, a corpus-based French lexical resource built by adapting the framework LexIt, originally developed to describe the combinatorial potential of Italian predicates. As in the original framework, the behavior of a group of target predicates is characterized by a series of syntactic (i.e., subcategorization frames) and semantic (i.e., selectional preferences) statistical information (a.k.a. distributional profiles) whose extraction process is mostly unsupervised. The first release of LexFr includes information for 2,493 verbs, 7,939 nouns and 2,628 adjectives. In these pages we describe the adaptation process and evaluated the final resource by comparing the information collected for 20 test verbs against the information available in a gold standard dictionary. In the best performing setting, we obtained 0.74 precision, 0.66 recall and 0.70 F-measure.
Anthology ID:
L16-1148
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
930–937
Language:
URL:
https://aclanthology.org/L16-1148
DOI:
Bibkey:
Cite (ACL):
Giulia Rambelli, Gianluca Lebani, Laurent Prévot, and Alessandro Lenci. 2016. LexFr: Adapting the LexIt Framework to Build a Corpus-based French Subcategorization Lexicon. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 930–937, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
LexFr: Adapting the LexIt Framework to Build a Corpus-based French Subcategorization Lexicon (Rambelli et al., LREC 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/L16-1148.pdf