Empty Argument Insertion in the Hindi PropBank

Ashwini Vaidya, Jinho D. Choi, Martha Palmer, Bhuvana Narasimhan


Abstract
This paper examines both linguistic behavior and practical implication of empty argument insertion in the Hindi PropBank. The Hindi PropBank is annotated on the Hindi Dependency Treebank, which contains some empty categories but not the empty arguments of verbs. In this paper, we analyze four kinds of empty arguments, *PRO*, *REL*, *GAP*, *pro*, and suggest effective ways of annotating these arguments. Empty arguments such as *PRO* and *REL* can be inserted deterministically; we present linguistically motivated rules that automatically insert these arguments with high accuracy. On the other hand, it is difficult to find deterministic rules to insert *GAP* and *pro*; for these arguments, we introduce a new annotation scheme that concurrently handles both semantic role labeling and empty category insertion, producing fast and high quality annotation. In addition, we present algorithms for finding antecedents of *REL* and *PRO*, and discuss why finding antecedents for some types of *PRO* is difficult.
Anthology ID:
L12-1229
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1522–1526
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/442_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Ashwini Vaidya, Jinho D. Choi, Martha Palmer, and Bhuvana Narasimhan. 2012. Empty Argument Insertion in the Hindi PropBank. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 1522–1526, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Empty Argument Insertion in the Hindi PropBank (Vaidya et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/442_Paper.pdf