default search action
Speech Communication, Volume 48
Volume 48, Number 1, January 2006
- Sebastian Möller, Jan Felix Krebber, Paula M. T. Smeele:
Evaluating the speech output component of a smart-home system. 1-27 - Heungkyu Lee, Hanseok Ko:
Competing models-based text-prompted speaker independent verification algorithm. 28-44 - Tomoki Toda, Hisashi Kawai, Minoru Tsuzaki, Kiyohiro Shikano:
An evaluation of cost functions sensitively capturing local degradation of naturalness for segment selection in concatenative speech synthesis. 45-56 - Chang Huai You, Soo Ngee Koh, Susanto Rahardja:
Masking-based beta-order MMSE speech enhancement. 57-70 - Ka-Yee Leung, Man-Wai Mak, Man-Hung Siu, Sun-Yuan Kung:
Adaptive articulatory feature-based conditional pronunciation modeling for speaker verification. 71-84 - Marie A. Roch:
Gaussian-selection-based non-optimal search for speaker identification. 85-95 - Kotta Manohar, Preeti Rao:
Speech enhancement in nonstationary noise environments using noise properties. 96-109
Volume 48, Number 2, February 2006
- Junfeng Li, Masato Akagi:
A noise reduction system based on hybrid noise estimation technique and post-filtering in arbitrary noise environments. 111-126 - Yassine Mami, Delphine Charlet:
Speaker recognition by location in the space of reference speakers. 127-141 - Vlasios Doumpiotis, William Byrne:
Lattice segmentation and minimum Bayes risk discriminative training for large vocabulary continuous speech recognition. 142-160 - Konstantin Markov, Jianwu Dang, Satoshi Nakamura:
Integration of articulatory and spectrum features based on the hybrid HMM/BN modeling framework. 161-175 - Andrea Facco, Daniele Falavigna, Roberto Gretter, Marcello Viganò:
Design and evaluation of acoustic and language models for large scale telephone services. 176-190 - Arnaud Martin, Laurent Mauuary:
Robust speech/non-speech detection based on LDA-derived parameter and voicing parameter for speech recognition in noisy environments. 191-206 - Cheng-Lung Lee, Wen-Whei Chang, Yuan-Chuan Chiang:
Spectral and prosodic transformations of hearing-impaired Mandarin speech. 207-219 - Sundarrajan Rangachari, Philipos C. Loizou:
A noise-estimation algorithm for highly non-stationary environments. 220-231
Volume 48, Numbers 3-4, March-April 2006
- Srinivas Bangalore, Dilek Hakkani-Tür, Gökhan Tür:
Introduction to the Special Issue on Spoken Language Understanding in Conversational Systems. 233-238 - Patrick Haffner:
Scaling large margin classifiers for spoken language understanding. 239-261 - Yulan He, Steve J. Young:
Spoken language understanding using the Hidden Vector State Model. 262-275 - Murat Saraclar, Brian Roark:
Utterance classification with discriminative language modeling. 276-287 - Christian Raymond, Frédéric Béchet, Renato de Mori, Géraldine Damnati:
On the use of finite state transducers for semantic interpretation. 288-304 - Chai Wutiwiwatchai, Sadaoki Furui:
A multi-stage approach for Thai spoken language understanding. 305-320 - Ruiqiang Zhang, Gen-ichiro Kikui:
Integration of speech recognition and machine translation: Speech recognition word lattice translation. 321-334 - Johan Boye, Joakim Gustafson, Mats Wirén:
Robust spoken language understanding in a computer game. 335-353 - Hilda Hardy, Alan W. Biermann, R. Bryce Inouye, Ashley McKenzie, Tomek Strzalkowski, Cristian Ursu, Nick Webb, Min Wu:
The Amities system: Data-driven techniques for automated dialogue. 354-373 - Qiang Huang, Stephen J. Cox:
Task-independent call-routing. 374-389 - Ye-Yi Wang, Alex Acero:
Rapid development of spoken language understanding grammars. 390-416 - Ryuichiro Higashinaka, Katsuhito Sudoh, Mikio Nakano:
Incorporating discourse features into confidence scoring of intention recognition results in spoken dialogue systems. 417-436 - Tong Zhang, Mark Hasegawa-Johnson, Stephen E. Levinson:
Extraction of pragmatic and semantic salience from spontaneous spoken English. 437-462
Volume 48, Number 5, May 2006
- Christopher Dromey, Shawn L. Nissen, Petrea Nohr, Samuel G. Fletcher:
Measuring tongue movements during speech: Adaptation of a magnetic jaw-tracking system. 463-473 - Marián Képesi, Luis Weruaga:
Adaptive chirp-based time-frequency analysis of speech signals. 474-492 - Hauke Schramm, Xavier L. Aubert, Bart Bakker, Carsten Meyer, Hermann Ney:
Modeling spontaneous speech variability in professional dictation. 493-515 - Atsushi Fujii, Katunobu Itou, Tetsuya Ishikawa:
LODEM: A system for on-demand video lectures. 516-531 - Carsten Meyer, Hauke Schramm:
Boosting HMM acoustic models in large vocabulary speech recognition. 532-548 - Mark D. Skowronski, John G. Harris:
Applied principles of clear and Lombard speech for automated intelligibility enhancement in noisy environments. 549-558 - Diane J. Litman, Katherine Forbes-Riley:
Recognizing student emotions and attitudes on the basis of utterances in spoken tutoring dialogues with both human and computer tutors. 559-590
Volume 48, Number 6, June 2006
- SungHee Kim, Robert D. Frisina, Frances M. Mapes, Elizabeth D. Hickman, D. Robert Frisina:
Effect of age on binaural speech intelligibility in normal hearing adults. 591-597 - Praveen K. Kakumanu, Anna Esposito, Oscar N. Garcia, Ricardo Gutierrez-Osuna:
A comparison of acoustic coding models for speech-driven facial animation. 598-615 - Tong Zhang, Mark Hasegawa-Johnson, Stephen E. Levinson:
Cognitive state classification in a spoken tutorial dialogue system. 616-632 - Cynthia G. Clopper, David B. Pisoni:
The Nationwide Speech Project: A new corpus of American English dialects. 633-644 - Daniel Recasens, Aina Espinosa:
Dispersion and variability of Catalan vowels. 645-666 - Amalia Arvaniti, D. Robert Ladd, Ineke Mennen:
Phonetic effects of focus and "tonal crowding" in intonation: Evidence from Greek polar questions. 667-696 - Ben Milner, Xu Shao:
Clean speech reconstruction from MFCC vectors and fundamental frequency using an integrated front-end. 697-715 - Min Chu, Yong Zhao, Eric Chang:
Modeling stylized invariance and local variability of prosody in text-to-speech synthesis. 716-726 - Leigh D. Alsteris, Kuldip K. Paliwal:
Further intelligibility results from human listening tests using the short-time phase spectrum. 727-736 - Junho Park, Hanseok Ko:
Achieving a reliable compact acoustic model for embedded speech recognition system with high confusion frequency model handling. 737-745 - Stephen So, Kuldip K. Paliwal:
Scalable distributed speech recognition using Gaussian mixture model-based block quantisation. 746-758
Volume 48, Number 7, July 2006
- Frédéric Bimbot, Marcos Faúndez-Zanuy, Renato de Mori:
Editorial. 759 - Kevin M. Indrebo, Richard J. Povinelli, Michael T. Johnson:
Sub-banded reconstructed phase spaces for speech recognition. 760-774 - Erhard Rank, Gernot Kubin:
An oscillator-plus-noise model for speech synthesis. 775-801 - Giampiero Salvi:
Dynamic behaviour of connectionist speech recognition with strong latency constraints. 802-818 - Dimitrios Dimitriadis, Petros Maragos:
Continuous energy demodulation methods and application to speech analysis. 819-837 - Marcos Faúndez-Zanuy:
Speech coding through adaptive combined nonlinear prediction. 838-847 - Laurent Benaroya, Frédéric Bimbot, Guillaume Gravier, Rémi Gribonval:
Experiments in audio source separation with one sensor for robust speech recognition. 848-854 - SungHee Kim, Robert D. Frisina, D. Robert Frisina:
Effects of age on speech understanding in normal hearing listeners: Relationship between the auditory efferent system and speech intelligibility in noise. 855-862
Volume 48, Number 8, August 2006
- Luis Fernando D'Haro, Ricardo de Córdoba, Javier Ferreiros, Stefan W. Hamerich, Volker Schless, Basilis Kladis, Volker Schubert, Otilia Kocsis, Stefan Igel, José Manuel Pardo:
An advanced platform to speed up the design of multilingual dialog applications for multiple modalities. 863-887 - Naveen Srinivasamurthy, Antonio Ortega, Shrikanth S. Narayanan:
Efficient scalable encoding for distributed speech recognition. 888-902 - Mohammad Ali Salmani-Nodoushan:
A comparative sociopragmatic study of ostensible invitations in English and Farsi. 903-912 - T. Nagarajan, Hema A. Murthy:
Language identification using acoustic log-likelihoods of syllable-like units. 913-926 - Yasser Ghanbari, Mohammad Reza Karami-Mollaei:
A new approach for speech enhancement based on the adaptive thresholding of the wavelet packets. 927-940 - Francisco Campillo Díaz, Eduardo Rodríguez Banga:
A method for combining intonation modelling and speech unit selection in corpus-based speech synthesis systems. 941-956 - Jean-Baptiste Maj, Liesbeth Royackers, Jan Wouters, Marc Moonen:
Comparison of adaptive noise reduction algorithms in dual microphone hearing aids. 957-970 - Roberto Togneri, Li Deng:
A state-space model with neural-network prediction for recovering vocal tract resonances in fluent speech from Mel-cepstral coefficients. 971-988 - Jinfu Ni, Keikichi Hirose:
Quantitative and structural modeling of voice fundamental frequency contours of speech in Mandarin. 989-1008 - Pushkar Patwardhan, Preeti Rao:
Effect of voice quality on frequency-warped modeling of vowel spectra. 1009-1023 - Adam Borowicz, Marek Parfieniuk, Alexander A. Petrovsky:
An application of the warped discrete Fourier transform in the perceptual speech enhancement. 1024-1036 - Jan Stadermann, Gerhard Rigoll:
Hybrid NN/HMM acoustic modeling techniques for distributed speech recognition. 1037-1046 - Ismail Shahin:
Enhancing speaker identification performance under the shouted talking condition using second-order circular hidden Markov models. 1047-1055
Volume 48, Number 9, September 2006
- Gerasimos Xydas, Georgios Kouroupetroglou:
Tone-Group F0 selection for modeling focus prominence in small-footprint speech synthesis. 1057-1078 - Felicia Roberts, Alexander L. Francis, Melanie Morgan:
The interaction of inter-turn silence with prosodic cues in listener perceptions of "trouble" in conversation. 1079-1093 - Fatih Ögüt, Mehmet Akif Kiliç, Erkan Zeki Engin, Rasit Midilli:
Voice onset times for Turkish stop consonants. 1094-1099 - Akira Sasou, Futoshi Asano, Satoshi Nakamura, Kazuyo Tanaka:
HMM-based noise-robust feature compensation. 1100-1111 - Alejandro Bassi, Néstor Becerra Yoma, Patricio Loncomilla:
Estimating tonal prosodic discontinuities in Spanish using HMM. 1112-1125 - Abhinav Sethy, Shrikanth S. Narayanan, S. Parthasarthy:
A split lexicon approach for improved recognition of spoken names. 1126-1136 - Teruhisa Misu, Tatsuya Kawahara:
Dialogue strategy to clarify user's queries for document retrieval system with speech interface. 1137-1150 - Makoto Hirohata, Yosuke Shinnaka, Koji Iwano, Sadaoki Furui:
Sentence-extractive automatic speech summarization and evaluation techniques. 1151-1161 - Dimitrios Ververidis, Constantine Kotropoulos:
Emotional speech recognition: Resources, features, and methods. 1162-1181 - Vivek Tyagi, Hervé Bourlard, Christian Wellekens:
On variable-scale piecewise stationary spectral analysis of speech signals for ASR. 1182-1191 - Joe Frankel, Simon King:
Observation process adaptation for linear dynamic models. 1192-1199 - Mohamed Faouzi BenZeghiba, Hervé Bourlard:
User-customized password speaker verification using multiple reference and background models. 1200-1213 - Dong Yu, Li Deng, Alex Acero:
A lattice search technique for a long-contextual-span hidden trajectory model of speech. 1214-1226
Volume 48, Number 10, October 2006
- Javier Latorre, Koji Iwano, Sadaoki Furui:
New approach to the polyglot speech generation by means of an HMM-based speaker adaptable synthesizer. 1227-1242 - S. R. Mahadeva Prasanna, Cheedella S. Gupta, B. Yegnanarayana:
Extraction of speaker-specific excitation information from linear prediction residual of speech. 1243-1261 - Özgül Salor, Mübeccel Demirekler:
Dynamic programming approach to voice transformation. 1262-1272 - Zhenyu Xiong, Thomas Fang Zheng, Zhanjiang Song, Frank K. Soong, Wenhu Wu:
A tree-based kernel selection approach to efficient Gaussian mixture model-universal background model based speaker identification. 1273-1282 - Gengxin Ning, Shu-hung Leung, Kam-keung Chu, Gang Wei:
A dynamic parameter compensation method for noisy speech recognition. 1283-1293 - Zekeriya Tufekci, John N. Gowdy, Sabri Gurbuz, Eric K. Patterson:
Applied mel-frequency discrete wavelet coefficients and parallel model compensation for noise-robust speech recognition. 1294-1307 - Rupal Patel, Maria I. Grigos:
Acoustic characterization of the question-statement contrast in 4, 7 and 11 year-old children. 1308-1318 - Sacha Krstulovic, Frédéric Bimbot, Olivier Boëffard, Delphine Charlet, Dominique Fohr, Odile Mella:
Optimizing the coverage of a speech database through a selection of representative speaker recordings. 1319-1348 - Wen Jin, Michael S. Scordilis:
Speech enhancement by residual domain constrained optimization. 1349-1364 - Abdellah Kacha, Francis Grenez, Jean Schoentgen:
Estimation of dysperiodicities in disordered speech. 1365-1378 - Jesús Vicente-Peña, Ascensión Gallardo-Antolín, Carmen Peláez-Moreno, Fernando Díaz-de-María:
Band-pass filtering of the time sequences of spectral parameters for robust wireless speech recognition. 1379-1398
Volume 48, Number 11, November 2006
- Ben Milner, Christian Wellekens, Børge Lindberg:
Special Issue on Robustness Issues for Conversational Interaction. 1399-1401
- Alastair Bruce James, Ben Milner:
Towards improving the robustness of distributed speech recognition in packet loss. 1402-1421 - Antonio Cardenal López, Carmen García-Mateo, Laura Docío Fernández:
Weighted Viterbi decoding strategies for distributed speech recognition over IP networks. 1422-1434 - Valentin Ion, Reinhold Haeb-Umbach:
Uncertainty decoding for distributed speech recognition over error-prone networks. 1435-1446
- Kentaro Ishizuka, Tomohiro Nakatani:
A feature extraction method using subband based periodicity and aperiodicity decomposition with noise robust frontend processing for automatic speech recognition. 1447-1457 - Benjamin J. Shannon, Kuldip K. Paliwal:
Feature extraction from higher-lag autocorrelation coefficients for robust speech recognition. 1458-1485 - Soundararajan Srinivasan, Nicoleta Roman, DeLiang L. Wang:
Binary and ratio time-frequency masks for robust speech recognition. 1486-1501 - Veronique Stouten, Hugo Van hamme, Patrick Wambacq:
Model-based feature enhancement with uncertainty decoding for noise robust ASR. 1502-1514 - Tran Huy Dat, Kazuya Takeda, Fumitada Itakura:
On-line Gaussian mixture modeling in the log-power domain for signal-to-noise ratio estimation and speech enhancement. 1515-1527 - Vivek Tyagi, Christian Wellekens, Dirk T. M. Slock:
Least squares filtering of speech signals for robust ASR. 1528-1544 - Esfandiar Zavarehei, Saeed Vaseghi, Qin Yan:
Inter-frame modeling of DFT trajectories of speech and noise for speech enhancement using Kalman filters. 1545-1555 - Jonathan Darch, Ben P. Milner, Saeed Vaseghi:
MAP prediction of formant frequencies and voicing class from MFCC vectors in noise. 1556-1572
- Richard C. Rose, Iker Arizmendi:
Efficient client-server based implementations of mobile speech recognition services. 1573-1589
- Frederik Stouten, Jacques Duchateau, Jean-Pierre Martens, Patrick Wambacq:
Coping with disfluencies in spontaneous speech recognition: Acoustic detection and linguistic context manipulation. 1590-1606
Volume 48, Number 12, December 2006
- Marcos Faúndez-Zanuy, Léonard Janer-García, Josep Roure Alcobé, Frédéric Bimbot, Renato de Mori:
Editorial. 1607 - Marcos Faúndez-Zanuy, Martin Hagmüller, Gernot Kubin:
Speaker verification security improvement by means of speech watermarking. 1608-1619 - Mohammed Bahoura, Jean Rouat:
Wavelet speech enhancement based on time-scale adaptation. 1620-1637 - Juan Manuel Górriz, Javier Ramírez, Elmar Wolfgang Lang, Carlos García Puntonet:
Hard C-means clustering for voice activity detection. 1638-1649 - Martin Hagmüller, Gernot Kubin:
Poincaré pitch marks. 1650-1665 - Giampiero Salvi:
Segment boundary detection via class entropy measurements in connectionist phoneme recognition. 1666-1676 - Sadao Hiroya, Takemi Mochida:
Multi-speaker articulatory trajectory formation based on speaker-independent articulatory HMMs. 1677-1690 - Anna Pribilová, Jiri Pribil:
Non-linear frequency scale mapping for voice conversion in text-to-speech system with cepstral description. 1691-1703 - Peter J. Murphy:
Periodicity estimation in synthesized phonation signals using cepstral rahmonic peaks. 1704-1713
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.