A Large Scale Database of Strongly-related Events in Japanese

Tomohide Shibata, Shotaro Kohama, Sadao Kurohashi


Abstract
The knowledge about the relation between events is quite useful for coreference resolution, anaphora resolution, and several NLP applications such as dialogue system. This paper presents a large scale database of strongly-related events in Japanese, which has been acquired with our proposed method (Shibata and Kurohashi, 2011). In languages, where omitted arguments or zero anaphora are often utilized, such as Japanese, the coreference-based event extraction methods are hard to be applied, and so our method extracts strongly-related events in a two-phrase construct. This method first calculates the co-occurrence measure between predicate-arguments (events), and regards an event pair, whose mutual information is high, as strongly-related events. To calculate the co-occurrence measure efficiently, we adopt an association rule mining method. Then, we identify the remaining arguments by using case frames. The database contains approximately 100,000 unique events, with approximately 340,000 strongly-related event pairs, which is much larger than an existing automatically-constructed event database. We evaluated randomly-chosen 100 event pairs, and the accuracy was approximately 68%.
Anthology ID:
L14-1082
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3283–3288
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/1107_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Tomohide Shibata, Shotaro Kohama, and Sadao Kurohashi. 2014. A Large Scale Database of Strongly-related Events in Japanese. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 3283–3288, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
A Large Scale Database of Strongly-related Events in Japanese (Shibata et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/1107_Paper.pdf