Jinsong Zhang
2022
基于GPT-2和互信息的语言单位信息量对韵律特征的影响(Prosodic Effects of Speech Unit’s Information Based on GPT-2 and Mutual Information)
Yun Hao (郝韵)
|
Yanlu Xie (解焱陆)
|
Binghuai Lin (林炳怀)
|
Jinsong Zhang (张劲松)
Proceedings of the 21st Chinese National Conference on Computational Linguistics
“基于信息论的言语产出研究发现携带信息量越大的语言单位,其语音信号越容易被强化。目前的相关研究主要通过自信息的方式衡量语言单位信息量,但该方法难以对长距离的上下文语境进行建模。本研究引入基于预训练语言模型GPT-2和文本-拼音互信息的语言单位信息量衡量方式,考察汉语的单词、韵母和声调信息量对语音产出的韵律特征的影响。研究结果显示汉语中单词和韵母信息量更大时,其韵律特征倾向于被增强,证明了我们提出的方法是有效的。其中信息量效应在音长特征上相比音高和音强特征更显著。”
2014
Phoneme Set Design Using English Speech Database by Japanese for Dialogue-Based English CALL Systems
Xiaoyun Wang
|
Jinsong Zhang
|
Masafumi Nishida
|
Seiichi Yamamoto
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
This paper describes a method of generating a reduced phoneme set for dialogue-based computer assisted language learning (CALL)systems. We designed a reduced phoneme set consisting of classified phonemes more aligned with the learners speech characteristics than the canonical set of a target language. This reduced phoneme set provides an inherently more appropriate model for dealing with mispronunciation by second language speakers. In this study, we used a phonetic decision tree (PDT)-based top-down sequential splitting method to generate the reduced phoneme set and then applied this method to a translation-game type English CALL system for Japanese to determine its effectiveness. Experimental results showed that the proposed method improves the performance of recognizing non-native speech.
Search
Co-authors
- Yun Hao (郝韵) 1
- Yanlu Xie (解焱陆) 1
- Binghuai Lin 1
- Xiaoyun Wang 1
- Masafumi Nishida 1
- show all...