Persian Proposition Bank

Azadeh Mirzaei, Amirsaeid Moloodi


Abstract
This paper describes the procedure of semantic role labeling and the development of the first manually annotated Persian Proposition Bank (PerPB) which added a layer of predicate-argument information to the syntactic structures of Persian Dependency Treebank (known as PerDT). Through the process of annotating, the annotators could see the syntactic information of all the sentences and so they annotated 29982 sentences with more than 9200 unique verbs. In the annotation procedure, the direct syntactic dependents of the verbs were the first candidates for being annotated. So we did not annotate the other indirect dependents unless their phrasal heads were propositional and had their own arguments or adjuncts. Hence besides the semantic role labeling of verbs, the argument structure of 1300 unique propositional nouns and 300 unique propositional adjectives were annotated in the sentences, too. The accuracy of annotation process was measured by double annotation of the data at two separate stages and finally the data was prepared in the CoNLL dependency format.
Anthology ID:
L16-1606
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3828–3835
Language:
URL:
https://aclanthology.org/L16-1606
DOI:
Bibkey:
Cite (ACL):
Azadeh Mirzaei and Amirsaeid Moloodi. 2016. Persian Proposition Bank. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 3828–3835, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Persian Proposition Bank (Mirzaei & Moloodi, LREC 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/update-css-js/L16-1606.pdf