2024
pdf
bib
abs
汉语中介语词同现网络研究(A Study on Chinese Interlanguage Co-occurrence Networks QIAN Long1 ZHAO Huizhou2 DING Qian3 WANG Zhimin4)
Long Qian (钱隆)
|
Huizhou Zhao (赵慧周)
|
Qian Ding (丁芊)
|
Zhimin Wang (王治敏)
Proceedings of the 23rd Chinese National Conference on Computational Linguistics (Volume 1: Main Conference)
“近年来,运用复杂网络方法进行语言学研究已成为数字人文研究的一条新路径。本文基于214篇日本汉语学习者的书面作文,构建了6个不同能力水平的汉语中介语词同现网络,并探讨了这些网络的结构特性及其动态演变过程。研究结果显示,所有的汉语中介语词同现网络均呈现出小世界属性、无标度属性、异配性和层级结构等复杂网络的特性。这些特性揭示了汉语学习者在词汇使用方面的特定模式:低水平学习者更倾向于将低频词汇与高频词汇进行连接,这可能与学习者减轻认知负荷的习得模式有关;学习者语言水平的提升,中介语网络参数会逐渐向母语者靠拢,但是无法达到母语者的水平;此外,本研究还观察到,语言错误会对中介语网络结构产生影响,引起网络结构的变异。”
2020
pdf
bib
abs
基于词语聚类的汉语口语教材自动推送素材研究(Study on Automatic Push Material of Oral Chinese Textbook Based on Word Clustering)
Bingbing Yang (杨冰冰)
|
Huizhou Zhao (赵慧周)
|
Zhimin Wang (王治敏)
Proceedings of the 19th Chinese National Conference on Computational Linguistics
新冠肺炎的蔓延使得线上移动教学成为教育发展的必然趋势,本文以适合汉语教材自动推送的口语素材为研究对象,基于10341条生活类口语语料,对词汇的整体特点进行计量分析,在此基础上使用词向量模型及Kmeans算法对全部词语进行聚类,参考词语聚类结果及对口语语料话题和场景的考察,构建了一个包含15个一级话题、102个二级话题及81个交际场景的汉语口语话题-场景素材库。同时对各级话题常用词进行了总结。本文可为教材自动定制的素材库提供资源支持。
pdf
bib
abs
SEMA: Text Simplification Evaluation through Semantic Alignment
Xuan Zhang
|
Huizhou Zhao
|
KeXin Zhang
|
Yiyang Zhang
Proceedings of the 6th Workshop on Natural Language Processing Techniques for Educational Applications
Text simplification is an important branch of natural language processing. At present, methods used to evaluate the semantic retention of text simplification are mostly based on string matching. We propose the SEMA (text Simplification Evaluation Measure through Semantic Alignment), which is based on semantic alignment. Semantic alignments include complete alignment, partial alignment and hyponymy alignment. Our experiments show that the evaluation results of SEMA have a high consistency with human evaluation for the simplified corpus of Chinese and English news texts.