Using Collaborative Filtering to Model Argument Selection

Sagar Indurkhya


Abstract
This study evaluates whether model-based Collaborative Filtering (CF) algorithms, which have been extensively studied and widely used to build recommender systems, can be used to predict which common nouns a predicate can take as its complement. We find that, when trained on verb-noun co-occurrence data drawn from the Corpus of Contemporary American-English (COCA), two popular model-based CF algorithms, Singular Value Decomposition and Non-negative Matrix Factorization, perform well on this task, each achieving an AUROC of at least 0.89 and surpassing several different baselines. We then show that the embedding-vectors for verbs and nouns learned by the two CF models can be quantized (via application of k-means clustering) with minimal loss of performance on the prediction task while only using a small number of verb and noun clusters (relative to the number of distinct verbs and nouns). Finally we evaluate the alignment between the quantized embedding vectors for verbs and the Levin verb classes, finding that the alignment surpassed several randomized baselines. We conclude by discussing how model-based CF algorithms might be applied to learning restrictions on constituent selection between various lexical categories and how these (learned) models could then be used to augment a (rule-based) constituency grammar.
Anthology ID:
2021.ranlp-1.71
Volume:
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021)
Month:
September
Year:
2021
Address:
Held Online
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
629–639
Language:
URL:
https://aclanthology.org/2021.ranlp-1.71
DOI:
Bibkey:
Cite (ACL):
Sagar Indurkhya. 2021. Using Collaborative Filtering to Model Argument Selection. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), pages 629–639, Held Online. INCOMA Ltd..
Cite (Informal):
Using Collaborative Filtering to Model Argument Selection (Indurkhya, RANLP 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/update-css-js/2021.ranlp-1.71.pdf