Probabilistic Prototype Model for Serendipitous Property Mining

Taesung Lee, Seung-won Hwang, Zhongyuan Wang


Abstract
Besides providing the relevant information, amusing users has been an important role of the web. Many web sites provide serendipitous (unexpected but relevant) information to draw user traffic. In this paper, we study the representative scenario of mining an amusing quiz. An existing approach leverages a knowledge base to mine an unexpected property then find quiz questions on such property, based on prototype theory in cognitive science. However, existing deterministic model is vulnerable to noise in the knowledge base. Therefore, we instead propose to leverage probabilistic approach to build a prototype that can overcome noise. Our extensive empirical study shows that our approach not only significantly outperforms baselines by 0.06 in accuracy, and 0.11 in serendipity but also shows higher relevance than the traditional relevance-pursuing baseline using TF-IDF.
Anthology ID:
C16-1064
Volume:
Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
Month:
December
Year:
2016
Address:
Osaka, Japan
Editors:
Yuji Matsumoto, Rashmi Prasad
Venue:
COLING
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
663–673
Language:
URL:
https://aclanthology.org/C16-1064
DOI:
Bibkey:
Cite (ACL):
Taesung Lee, Seung-won Hwang, and Zhongyuan Wang. 2016. Probabilistic Prototype Model for Serendipitous Property Mining. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pages 663–673, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
Probabilistic Prototype Model for Serendipitous Property Mining (Lee et al., COLING 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/naacl24-info/C16-1064.pdf