汉语学习者依存句法树库构建(Construction of a Treebank of Learner Chinese)

Jialu Shi (师佳璐), Xinyu Luo (罗昕宇), Liner Yang (杨麟儿), Dan Xiao (肖丹), Zhengsheng Hu (胡正声), Yijun Wang (王一君), Jiaxin Yuan (袁佳欣), Yu Jingsi (余婧思), Erhong Yang (杨尔弘)


Abstract
汉语学习者依存句法树库为非母语者语料提供依存句法分析,可以支持第二语言教学与研究,也对面向第二语言的句法分析、语法改错等相关研究具有重要意义。然而,现有的汉语学习者依存句法树库数量较少,且在标注方面仍存在一些问题。为此,本文改进依存句法标注规范,搭建在线标注平台,并开展汉语学习者依存句法标注。本文重点介绍了数据选取、标注流程等问题,并对标注结果进行质量分析,探索二语偏误对标注质量与句法分析的影响。
Anthology ID:
2020.ccl-1.54
Volume:
Proceedings of the 19th Chinese National Conference on Computational Linguistics
Month:
October
Year:
2020
Address:
Haikou, China
Venue:
CCL
SIG:
Publisher:
Chinese Information Processing Society of China
Note:
Pages:
581–592
Language:
Chinese
URL:
https://aclanthology.org/2020.ccl-1.54
DOI:
Bibkey:
Cite (ACL):
Jialu Shi, Xinyu Luo, Liner Yang, Dan Xiao, Zhengsheng Hu, Yijun Wang, Jiaxin Yuan, Yu Jingsi, and Erhong Yang. 2020. 汉语学习者依存句法树库构建(Construction of a Treebank of Learner Chinese). In Proceedings of the 19th Chinese National Conference on Computational Linguistics, pages 581–592, Haikou, China. Chinese Information Processing Society of China.
Cite (Informal):
汉语学习者依存句法树库构建(Construction of a Treebank of Learner Chinese) (Shi et al., CCL 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2020.ccl-1.54.pdf
Data
Universal Dependencies