Filled pauses as cues to the complexity of upcoming phrases for native and non-native listeners M Watanabe, K Hirose, Y Den, N Minematsu Speech communication 50 (2), 81-94, 2008 | 167 | 2008 |
Free software toolkit for Japanese large vocabulary continuous speech recognition T Kawahara, A Lee, T Kobayashi, K Takeda, N Minematsu, S Sagayama, ... | 154 | 2000 |
WFST-based grapheme-to-phoneme conversion: Open source tools for alignment, model-building and decoding JR Novak, N Minematsu, K Hirose Proceedings of the 10th International Workshop on Finite State Methods and …, 2012 | 138 | 2012 |
Unsupervised optimal phoneme segmentation: Objectives, algorithm and comparisons Y Qiao, N Shimomura, N Minematsu 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008 | 107 | 2008 |
Phonetisaurus: Exploring grapheme-to-phoneme conversion with joint n-gram models in the WFST framework JR Novak, N Minematsu, K Hirose Natural Language Engineering 22 (6), 907-938, 2016 | 104 | 2016 |
A Study on Invariance of -Divergence and Its Application to Speech Recognition Y Qiao, N Minematsu IEEE Transactions on Signal Processing 58 (7), 3884-3890, 2010 | 96 | 2010 |
Mathematical evidence of the acoustic universal structure in speech N Minematsu Proceedings.(ICASSP'05). IEEE International Conference on Acoustics, Speech …, 2005 | 95 | 2005 |
Automatic estimation of one's age with his/her speech based upon acoustic modeling techniques of speakers N Minematsu, M Sekiguchi, K Hirose 2002 IEEE International Conference on Acoustics, Speech, and Signal …, 2002 | 95 | 2002 |
One-to-many voice conversion based on tensor representation of speaker space D Saito, K Yamamoto, N Minematsu, K Hirose Twelfth Annual Conference of the International Speech Communication Association, 2011 | 91 | 2011 |
A method for automatic extraction of model parameters from fundamental frequency contours of speech S Narusawa, N Minematsu, K Hirose, H Fujisaki 2002 IEEE International conference on acoustics, speech, and signal …, 2002 | 90 | 2002 |
Development of English speech database read by Japanese to support CALL research N Minematsu, Y Tomiyama, K Yoshimoto, K Shimizu, S Nakagawa, ... Proc. ICA 1 (2004), 557-560, 2004 | 84 | 2004 |
Sharable software repository for Japanese large vocabulary continuous speech recognition T Kawahara, T Kobayashi, K Takeda, N Minematsu, K Itou, M Yamamoto, ... | 73 | 1998 |
Wasserstein GAN and waveform loss-based acoustic model training for multi-speaker text-to-speech synthesis systems using a WaveNet vocoder Y Zhao, S Takaki, HT Luong, J Yamagishi, D Saito, N Minematsu IEEE access 6, 60478-60488, 2018 | 59 | 2018 |
Japanese dictation toolkit-1997 version T Kawahara, A Lee, T Kobayashi, K Takeda, N Minematsu, K Itou, A Ito, ... Journal of the Acoustical Society of Japan (E) 20 (3), 233-239, 1999 | 56 | 1999 |
Yet another acoustic representation of speech sounds N Minematsu 2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004 | 53 | 2004 |
Synthesis of F0 contours using generation process model parameters predicted from unlabeled corpora: Application to emotional speech synthesis K Hirose, K Sato, Y Asano, N Minematsu Speech communication 46 (3-4), 385-404, 2005 | 52 | 2005 |
English Speech Database Read by Japanese Learners for CALL System Development. N Minematsu, Y Tomiyama, K Yoshimoto, K Shimizu, S Nakagawa, ... LREC, 2002 | 51 | 2002 |
Improving WFST-based G2P conversion with alignment constraints and RNNLM N-best rescoring JR Novak, PR Dixon, N Minematsu, K Hirose, C Hori, H Kashioka Thirteenth Annual Conference of the International Speech Communication …, 2012 | 46 | 2012 |
Structural representation of the pronunciation and its use for CALL N Minematsu, S Asakawa, K Hirose 2006 IEEE Spoken Language Technology Workshop, 126-129, 2006 | 45 | 2006 |
Multi-stream parameterization for structural speech recognition S Asakawa, N Minematsu, K Hirose 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008 | 44 | 2008 |