一种基于相似度的藏文词同现网络构建及特征分析(A Research on Construction and Feature Analysis of Similarity-based Tibetan Word Co-occurrence Networks)
Dongzhou Jiayang (加羊东周), Zhijie Cai (才智杰), Zhuoma Cairang (才让卓玛), Maocuo San (三毛措)
Abstract
语言文字是人类智慧和文明的结晶,是经过漫长演化形成的复杂系统。语言同现网络采 用复杂网络技术研究语言的特征,揭示语言文字的内部结构关系。文章分析相似性同 现网络构建模块结构,提出一种基于相似度的藏文词同现网络构建方法,该方法以词 为网络节点,以相似词间连边构造词同现网络。基于相似度藏文词同现网络构建方法, 在大、中、小三类文档上建立了词同现网络,并分析了它们的统计特征,实验数据表明 建立的藏文词同现网络都具有小世界效应和无标度特征。- Anthology ID:
- 2020.ccl-1.47
- Volume:
- Proceedings of the 19th Chinese National Conference on Computational Linguistics
- Month:
- October
- Year:
- 2020
- Address:
- Haikou, China
- Editors:
- Maosong Sun (孙茂松), Sujian Li (李素建), Yue Zhang (张岳), Yang Liu (刘洋)
- Venue:
- CCL
- SIG:
- Publisher:
- Chinese Information Processing Society of China
- Note:
- Pages:
- 509–517
- Language:
- Chinese
- URL:
- https://aclanthology.org/2020.ccl-1.47
- DOI:
- Cite (ACL):
- Dongzhou Jiayang, Zhijie Cai, Zhuoma Cairang, and Maocuo San. 2020. 一种基于相似度的藏文词同现网络构建及特征分析(A Research on Construction and Feature Analysis of Similarity-based Tibetan Word Co-occurrence Networks). In Proceedings of the 19th Chinese National Conference on Computational Linguistics, pages 509–517, Haikou, China. Chinese Information Processing Society of China.
- Cite (Informal):
- 一种基于相似度的藏文词同现网络构建及特征分析(A Research on Construction and Feature Analysis of Similarity-based Tibetan Word Co-occurrence Networks) (Jiayang et al., CCL 2020)
- PDF:
- https://preview.aclanthology.org/naacl-24-ws-corrections/2020.ccl-1.47.pdf