default search action
7th SSW 2010: Kyoto, Japan
- Yoshinori Sagisaka, Keiichi Tokuda:
The Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, SSW 2010, Kyoto, Japan, September 22-24, 2010. ISCA 2010
Tutorials
- Hideki Kawahara:
Exploration of the other aspect of vocoder revisited: A-Z STRAIGHT, TANDEM-STRAIGHT and morphing. 32-37 - Simon King:
Speech synthesis without the right data. 38
Concatenative Speech Synthesis
- H. Timothy Bunnell:
Crafting small databases for unit selection TTS: effects on intelligibility. 40-44 - Alistair Conkie, Ann K. Syrdal:
Composite TTS voices. 45-48 - Alexander Kain, Todd K. Leen:
Compression of line spectral frequency parameters using the asynchronous interpolation model. 49-54
Voice Conversion
- Fernando Villavicencio, Esteban Maestre:
GMM-PCA based speaker-timbre conversion on full-quality speech. 56-61 - Yi-Chin Huang, Chung-Hsien Wu, Chung-Han Lee, Yu-Ting Chao:
Voice conversion using precise speech alignment based on spectral property and eigen-codeword distribution. 62-67 - Elizabeth Godoy, Olivier Rosec, Thierry Chonavel:
On transforming spectral peaks in voice conversion. 68-73 - Chie Hayashida, Tomoki Toda, Yamato Ohtani, Hiroshi Saruwatari, Kiyohiro Shikano:
Linear transformation approaches to many-to-one voice conversion. 74-79 - Takashi Nose, Takao Kobayashi:
HMM-based robust voice conversion using adaptive F0 quantization. 80-85
Statistical Parametric Speech Synthesis
- Ranniery Maia, Heiga Zen, Mark J. F. Gales:
Statistical parametric speech synthesis with joint estimation of acoustic and excitation model parameters. 88-93 - Kai Yu, Blaise Thomson, Steve J. Young:
From discontinuous to continuous F0 modelling in HMM-based speech synthesis. 94-99 - Shinji Takaki, Yoshihiko Nankaku, Keiichi Tokuda:
Spectral modeling with contextual additive structure for HMM-based speech synthesis. 100-105 - Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Bayesian speech synthesis framework integrating training and synthesis processes. 106-111
Expressive Speech Synthesis
- Ingmar Steiner, Marc Schröder, Marcela Charfuelan, Annette Klepp:
Symbolic vs. acoustics-based style control for expressive unit selection. 114-119 - Jan Romportl, Enrico Zovato, Raúl Aquino-Santos, Pavel Ircing, José Relaño-Gil, Morena Danieli:
Application of expressive TTS synthesis in an advanced ECA system. 120-125 - Chih-Yung Yang, Chia-Ping Chen:
A hidden Markov model-based approach for emotional speech synthesis. 126-129 - Fabio Tesser, Enrico Zovato, Mauro Nicolao, Piero Cosi:
Two vocoder techniques for neutral to emotional timbre conversion. 130-135
Evaluation and Applications
- Maria K. Wolters, Karl Isaac, Steve Renals:
Evaluating speech synthesis intelligibility using Amazon Mechanical Turk. 136-141 - Anna C. Janska, Robert A. J. Clark:
Further exploration of the possibilities and pitfalls of multidimensional scaling as a tool for the evaluation of the quality of synthesized speech. 142-147 - Kishore Prahallad, Alan W. Black:
Handling large audio files in audio books for building synthetic voices. 148-153 - Gopala Krishna Anumanchipalli, Prasanna Kumar Muthukumar, Udhyakumar Nallasamy, Alok Parlikar, Alan W. Black, Brian Langner:
Improving speech synthesis for noisy environments. 154-159
Prosody and Conversation
- Kishore Prahallad, E. Veera Raghavendra, Alan W. Black:
Learning speaker-specific phrase breaks for text-to-speech systems. 162-166 - Nobuyuki Nishizawa, Tsuneo Kato:
Substitution of state distributions to reproduce natural prosody on HMM-based speech synthesizers. 167-172 - J. Sebastian Andersson, Junichi Yamagishi, Robert A. J. Clark:
Utilising spontaneous conversational speech in HMM-based speech synthesis. 173-178 - Ann K. Syrdal, Alistair Conkie, Yeon-Jun Kim, Mark C. Beutnagel:
Speech acts and dialog TTS. 179-183
Multi-Lingual Speech Synthesis
- Heiga Zen, Norbert Braunschweiler, Sabine Buchholz, Kate M. Knill, Sacha Krstulovic, Javier Latorre:
HMM-based polyglot speech synthesis by speaker and language adaptive training. 186-191 - Mirjam Wester, John Dines, Matthew Gibson, Hui Liang, Yi-Jian Wu, Lakshmi Babu Saheer, Simon King, Keiichiro Oura, Philip N. Garner, William Byrne, Yong Guan, Teemu Hirsimäki, Reima Karhila, Mikko Kurimo, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, Junichi Yamagishi:
Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project. 192-197
Selected Topics
- Jerome R. Bellegarda:
Toward naturally expressive speech synthesis: data - driven emotion detection using latent affective analysis. 200-205 - Gopala Krishna Anumanchipalli, Ying-Chang Cheng, Joseph Fernandez, Xiaohan Huang, Qi Mao, Alan W. Black:
KLATTSTAT: knowledge-based parametric speech synthesis. 206-210 - Keiichiro Oura, Ayami Mase, Tomohiko Yamada, Satoru Muto, Yoshihiko Nankaku, Keiichi Tokuda:
Recent development of the HMM-based singing voice synthesis system - Sinsy. 211-216 - Lijuan Wang, Xiaojun Qian, Wei Han, Frank K. Soong:
Photo-real lips synthesis with trajectory-guided sample selection. 217-222
Poster Sessions
- Lakshmi Babu Saheer, John Dines, Philip N. Garner, Hui Liang:
Implementation of VTLN for statistical speech synthesis. 224-229 - Eva Lasarcyk, Charlotte Wollermann:
Do prosodic cues influence uncertainty perception in articulatory speech synthesis? 230-235 - Yong Guan, Jilei Tian, Yi-Jian Wu, Junichi Yamagishi, Jani Nurminen:
An unified and automatic approach of Mandarin HTS system. 236-239 - Sathish Pammi, Marc Schröder, Marcela Charfuelan, Oytun Türk, Ingmar Steiner:
Synthesis of listener vocalisations with imposed intonation contours. 240-245 - Jinfu Ni, Hisashi Kawai:
An investigation of the impact of speech transcript errors on HMM voices. 246-251 - Keijiro Saino, Makoto Tachibana, Hideki Kenmochi:
An HMM-based singing style modeling system for singing voice synthesizers. 252-257 - Dong-Yan Huang, Susanto Rahardja, Ee Ping Ong:
Lombard effect mimicking. 258-263 - Chen-Yu Chiang, Sin-Horng Chen, Yih-Ru Wang:
Unsupervised prosody labeling for constructing Mandarin TTS. 264-269 - Benjamin Picart, Thomas Drugman, Thierry Dutoit:
Analysis and synthesis of hypo- and hyperarticulated speech. 270-275 - Rajakrishnan Rajkumar, Michael White, Shari R. Speer, Kiwako Ito:
Evaluating prosody in synthetic speech with online (eye-tracking) and offline (rating) methods. 276-281 - Xu Shao, Vincent Pollet, Andrew P. Breen:
Refined statistical model tuning for speech synthesis. 284-287 - Didier Cadic, Christophe d'Alessandro:
High quality TTS voices within one day. 288-293 - Tatyana Polyakova, Antonio Bonafonte:
Nativization of English words in Spanish using analogy. 294-299 - Asami Yamamoto, Kazuhiro Suzuki, Kook Cho, Yoichi Yamashita:
Automatic prosodic labeling of accent information for Japanese spoken sentences. 300-305 - Mohamed Abou-Zleikha, Peter Cahill, Julie Carson-Berndsen:
An automatic pitch model with distance function. 306-311 - Minghui Dong, Ling Cen, Paul Y. Chan, Haizhou Li:
Considering readability in text-to-speech recording script design. 312-316 - Oliver Watts, Junichi Yamagishi, Simon King:
Letter-based speech synthesis. 317-322 - Christophe Veaux, Pierre Lanchantin, Xavier Rodet:
Joint prosodic and segmental unit selection for expressive speech synthesis. 323-327 - Pieter Scholtz, Justus C. Roux, Jacques P. du Toit:
Speech synthesis in the mobile user interface. 328-331 - Tuomo Raitio, Antti Suni, Hannu Pulakka, Martti Vainio, Paavo Alku:
Comparison of formant enhancement methods for HMM-based speech synthesis. 334-339 - Mumtaz B. Mustafa, Raja Noor Ainon, Roziati Zainuddin:
EM-HTS: real-time HMM-based Malay emotional speech synthesis. 340-344 - Dong-Yan Huang, Susanto Rahardja, Ee Ping Ong:
High level emotional speech morphing using STRAIGHT. 345-350 - Jean-Philippe Goldman, Sophie Roekhaut, Anne-Catherine Simon:
Adding speaking style to a TTS system. 351-354 - Donata Moers, Igor Jauk, Bernd Möbius, Petra Wagner:
Synthesizing fast speech by implementing multi-phone units in unit selection speech synthesis. 355-358 - Miaomiao Wang, Miaomiao Wen, Daisuke Saito, Keikichi Hirose, Nobuaki Minematsu:
Improved generation of prosodic features in HMM-based Mandarin speech synthesis. 359-364 - João P. Cabral, Steve Renals, Korin Richmond, Junichi Yamagishi:
An HMM-based speech synthesiser using glottal post-filtering. 365-370 - Yeon-Jun Kim, Mark C. Beutnagel:
A study of lexical stress patterns in unit selection synthesis. 371-376 - Andreas Windmann, Petra Wagner, Fabio Tamburini, Denis Arnold, Catharine Oertel:
Automatic prominence annotation of a German speech synthesis corpus: towards prominence-based prosody generation for unit selection synthesis. 377-382
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.