Yoshinori Sagisaka


2016

pdf bib
Comparison of Grapheme-to-Phoneme Conversion Methods on a Myanmar Pronunciation Dictionary
Ye Kyaw Thu | Win Pa Pa | Yoshinori Sagisaka | Naoto Iwahashi
Proceedings of the 6th Workshop on South and Southeast Asian Natural Language Processing (WSSANLP2016)

Grapheme-to-Phoneme (G2P) conversion is the task of predicting the pronunciation of a word given its graphemic or written form. It is a highly important part of both automatic speech recognition (ASR) and text-to-speech (TTS) systems. In this paper, we evaluate seven G2P conversion approaches: Adaptive Regularization of Weight Vectors (AROW) based structured learning (S-AROW), Conditional Random Field (CRF), Joint-sequence models (JSM), phrase-based statistical machine translation (PBSMT), Recurrent Neural Network (RNN), Support Vector Machine (SVM) based point-wise classification, Weighted Finite-state Transducers (WFST) on a manually tagged Myanmar phoneme dictionary. The G2P bootstrapping experimental results were measured with both automatic phoneme error rate (PER) calculation and also manual checking in terms of voiced/unvoiced, tones, consonant and vowel errors. The result shows that CRF, PBSMT and WFST approaches are the best performing methods for G2P conversion on Myanmar language.

2014

pdf
Integrating Dictionaries into an Unsupervised Model for Myanmar Word Segmentation
Ye Kyaw Thu | Andrew Finch | Eiichiro Sumita | Yoshinori Sagisaka
Proceedings of the Fifth Workshop on South and Southeast Asian Natural Language Processing

2013

pdf
Density Maximization in Context-Sense Metric Space for All-words WSD
Koichi Tanigaki | Mitsuteru Shiba | Tatsuji Munaka | Yoshinori Sagisaka
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

2012

pdf
Trans-disciplinary spoken language processing studies for scientific understanding of second language learner’s characteristics
Yoshinori Sagisaka
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, volume 4: Invited Conferences

2001

pdf
Multi-Class Composite N-gram Language Model for Spoken Language Processing Using Multiple Word Clusters
Hirofumi Yamamoto | Shuntaro Isogai | Yoshinori Sagisaka
Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics

1998

pdf
Learning a Syntagmatic and Paradigmatic Structure from Language Data with a Bi-Multigram Model
Sabine Deligne | Yoshinori Sagisaka
36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Volume 1

pdf
Learning a syntagmatic and paradigmatic structure from language data with a bi-multigram model
Sabine Deligne | Yoshinori Sagisaka
COLING 1998 Volume 1: The 17th International Conference on Computational Linguistics