default search action
SpeD 2023: Bucharest, Romania
- International Conference on Speech Technology and Human-Computer Dialogue, SpeD 2023, Bucharest, Romania, October 25-27, 2023. IEEE 2023, ISBN 979-8-3503-2797-7
- Cristian-Teodor Neamtu, Serban Mihalache, Dragos Burileanu:
Liveness Detection - Automatic Classification of Spontaneous and Pre-recorded Speech for Biometric Applications. 1-5 - Doina Jitca:
How F0 Contours Reflect Speech Nuclear Events. 6-11 - Marian Negru, Bogdan Morosanu, Ana Neacsu, Dragos Draghicescu, Cristian Negrescu:
Automatic Audio Upmixing Based on Source Separation and Ambient Extraction Algorithms. 12-17 - Cristian Lucian Stanciu, Cristian Anghel, Camelia Elisei-Iliescu:
Regularized RLS Adaptive Algorithm with Conjugate Gradient Method. 18-23 - Bogdan Morosanu, Marian Negru, Ana Neacsu, Cristian Negrescu, Constantin Paleologu:
Personalized Multi-Track Leveling Algorithm. 24-29 - Tudor-Vasile Serban-Moga, Lacrimioara Grama, Corneliu Rusu:
Classification and Identification of Certain Types of Car Accidents Based on Sound Information. 30-35 - Muhammad Ali Farooq, Dan Bigioi, Rishabh Jain, Wang Yao, Mariam Yahayah Yiwere, Peter Corcoran:
Synthetic Speaking Children - Why We Need Them and How to Make Them. 36-41 - Andrei Barcovschi, Rishabh Jain, Peter Corcoran:
A comparative analysis between Conformer-Transducer, Whisper, and wav2vec2 for improving the child speech recognition. 42-47 - Shaimaa Alwaisi, Mohammed Salah Al-Radhi, Géza Németh:
Automated Child Voice Generation: Methodology and Implementation. 48-53 - Rishabh Jain, Peter Corcoran:
Improved Child Text-to-Speech Synthesis through Fastpitch-based Transfer Learning. 54-59 - Cathlyn Abion, Niel Carlo Lumapag, John C. Ramirez, Christchelle Resulto, Crisron Rudolf Lucas:
Comparison of Data Augmentation Techniques on Filipino ASR for Children's Speech. 60-65 - Gabriel Cosache, Francisco Salgado, Rishabh Jain, Cosmin Rotariu, George Sterpu, Peter Corcoran:
Data Center Audio/Video Intelligence on Device (DAVID) - An Edge-AI Platform for Smart-Toys. 66-71 - Mike H. M. Teodorescu, Ionut Taraboanta, Sorin Andrian, Cristina Angela Ghiorghe, Horia-Nicolai Teodorescu, Mihaela Grigorie:
Fairness Constraints in Speech-to-Text Applications - Clinical Effects of Salivation. 72-77 - Seyed Reza Shahamiri, Krishnendu Mandal, Sudeshna Sarkar:
Dysarthric Speech Recognition using Depthwise Separable Convolutions: Preliminary Study. 78-82 - Horia-Nicolai Teodorescu, Stefan-Andrei Gheltu:
Analysis of Formant Correlations in Emotional Speech - Do Formants Correlate? 83-88 - Dhvani Shah, Vanshika Lal, Zihan Zhong, Qianli Wang, Seyed Reza Shahamiri:
Dysarthric Speech Recognition: A Comparative Study. 89-94 - Jan Malucha:
Investigation of a Specific Effect of Alcohol on Formants. 95-99 - Adrian Bogdan Stânea, Vlad Striletchi, Cosmin Striletchi, Adriana Cornelia Stan:
An analysis of large speech models-based representations for speech emotion recognition. 100-104 - Bogdan Marghescu, Stefan-Adrian Toma, Luciana Morogan, Ion Bica:
Speech Emotion Recognition for Emergency Services. 105-110 - Alexandru Vulpe, Marius Zamfirache, Alexandru Caranica:
Analysis of Spectral Entropy and Maximum Power of EEG as Authentication Mechanisms. 111-115 - Cristian Manolache, Cristina Andronache, Alexandru Caranica, Horia Cucu, Andi Buzo, Cristian Diaconu, Georg Pelz:
Applying Multi-objective Acquisition Function Ensemble for a candidate proposal algorithm. 116-121 - Maria-Madalina Andronache, Alexandru Vulpe:
Experimental Analysis of Network Traffic Databases for Anomaly Detection. 122-127 - Georgian Nicolae, Catalin Visan, Dan Curavale, Mihai Boldeanu, Horia Cucu, Andi Buzo, Georg Pelz:
A Study on Initial Population Sampling for Multi-Objective Optimization based on Differential Evolution and Bayesian Inference. 128-132 - Silviu Ioan Bejinariu, Vasile Apopei, Manuela Nevaci, Florin-Teodor Olariu, Nicolae Saramandu:
Information Technology and Geolinguistics. 133-140 - Vasile Pais, Verginica Barbu Mititelu, Radu Ion, Elena Irimia:
Evaluating a Fine-Tuned Whisper Model on Underrepresented Romanian Speech. 141-145 - Márcio Fuckner, Sophie Horsman, Pascal Wiggers, Iskaj Janssen:
Uncovering Bias in ASR Systems: Evaluating Wav2vec2 and Whisper for Dutch speakers. 146-151 - Mohammed Salah Al-Radhi, Omnia Ibrahim, Ali Raheem Mandeel, Tamás Gábor Csapó, Géza Németh:
Advancing Limited Data Text-to-Speech Synthesis: Non-Autoregressive Transformer for High-Quality Parallel Synthesis. 152-157 - Ali Raheem Mandeel, Mohammed Salah Al-Radhi, Tamás Gábor Csapó:
Enhancing End-to-End Speech Synthesis by Modeling Interrogative Sentences with Speaker Adaptation. 158-163 - Camille Marie Tatoy, Johanna Lindsey Pasco, Josaphat Ira Emmanuel Benedicto, Crisron Rudolf Lucas:
Harmonic-plus-Noise Network with Linear Prediction and Perceptual Weighting Filters for Filipino Speech Synthesis. 164-169 - Ali Raheem Mandeel, Mohammed Salah Al-Radhi, Tamás Gábor Csapó:
Modeling Irregular Voice in End-to-End Speech Synthesis via Speaker Adaptation. 170-175 - Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh:
Nonparallel Expressive TTS for Unseen Target Speaker using Style-Controlled Adaptive Layer and Optimized Pitch Embedding. 176-181
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.