Unsupervised Cross-Lingual Speaker Adaptation for HMM-Based Speech Synthes

Oura Keiichiro; 大浦 圭一郎; オオウラ ケイイチロウ; Wu Yi-Jian; Wu Yi-Jian; Yamagishi Junichi; King Simon; Wester Mirjam

Title	en Unsupervised Cross-Lingual Speaker Adaptation for HMM-Based Speech Synthes
Creator	en Oura, Keiichiro ja 大浦, 圭一郎 ja-Kana オオウラ, ケイイチロウ en Wu, Yi-Jian ja Wu, Yi-Jian en Yamagishi, Junichi en King, Simon en Wester, Mirjam
Description	Other en In the EMIME project, we are developing a mobile device that performs personalized speech-to-speech translation such that a user's spoken input in one language is used to produce spoken output in another language, while continuing to sound like the user's voice. We integrate two techniques, unsupervised adaptation for HMM-based TTS using a word-based large-vocabulary continuous speech recognizer and cross-lingual speaker adaptation for HMM-based TTS, into a single architecture. Thus, an unsupervised cross-lingual speaker adaptation system can be developed. Listening tests show very promising results, demonstrating that adapted voices sound similar to the target speaker and that differences between supervised and unsupervised cross-lingual speaker adaptation are small. Other en 14-19 March 2010Dallas, TX, USA
Publisher	en Institute of Electrical and Electronics Engineers
Date	Issued2010
Language	eng
Resource Type	conference paper
Version Type	VoR
Identifier	URI https://nitech.repo.nii.ac.jp/records/3415
Journal	en ICASSP 2010. IEEE International Conference on Acoustics, Speech and Signal Processing, 2010. Page Start4594 Page End4597
File	本文_fulltext 142.3 kB (application/pdf) Available2017-01-17
Oaidate	2025-03-14