A subtree-based factorization of dependency parsing

Qiuye Zhao, Qun Liu


Abstract
We propose a dependency parsing pipeline, in which the parsing of long-distance projections and localized dependencies are explicitly decomposed at the input level. A chosen baseline dependency parsing model performs only on ‘carved’ sequences at the second stage, which are transformed from coarse constituent parsing outputs at the first stage. When k-best constituent parsing outputs are kept, a third-stage is required to search for an optimal combination of the overlapped dependency subtrees. In this sense, our dependency model is subtree-factored. We explore alternative approaches for scoring subtrees, including feature-based models as well as continuous representations. The search for optimal subset to combine is formulated as an ILP problem. This framework especially benefits the models poor on long sentences, generally improving baselines by 0.75-1.28 (UAS) on English, achieving comparable performance with high-order models but faster. For Chinese, the most notable increase is as high as 3.63 (UAS) when the proposed framework is applied to first-order parsing models.
Anthology ID:
C16-1057
Volume:
Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
Month:
December
Year:
2016
Address:
Osaka, Japan
Editors:
Yuji Matsumoto, Rashmi Prasad
Venue:
COLING
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
589–598
Language:
URL:
https://aclanthology.org/C16-1057
DOI:
Bibkey:
Cite (ACL):
Qiuye Zhao and Qun Liu. 2016. A subtree-based factorization of dependency parsing. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pages 589–598, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
A subtree-based factorization of dependency parsing (Zhao & Liu, COLING 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-1/C16-1057.pdf