default search action
Akira Sasou
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c34]Bagus Tris Atmaja, Akira Sasou:
Multilingual, Cross-lingual, and Monolingual Speech Emotion Recognition on EmoFilm Dataset. APSIPA ASC 2023: 1019-1025 - [c33]Bagus Tris Atmaja, Akira Sasou:
Ensembling Multilingual Pre-Trained Models for Predicting Multi-Label Regression Emotion Share from Speech. APSIPA ASC 2023: 1026-1029 - [c32]Bagus Tris Atmaja, Akira Sasou:
Evaluating Variants of wav2vec 2.0 on Affective Vocal Burst Tasks. ICASSP 2023: 1-5 - [c31]Akira Sasou, Yang Chen:
Comparison of GIF- and SSL-based Features in Pathological-voice Detection. INTERSPEECH 2023: 2893-2897 - [c30]Akira Sasou, Satoki Ogiso, Akihiko Nagakubo:
Deep Extreme Learning Machine With its Application to Body-Conducted-Sound-Based Handwork Recognition. MLSP 2023: 1-6 - [i3]Bagus Tris Atmaja, Akira Sasou:
Ensembling Multilingual Pre-Trained Models for Predicting Multi-Label Regression Emotion Share from Speech. CoRR abs/2309.11014 (2023) - 2022
- [j12]Bagus Tris Atmaja, Akira Sasou, Masato Akagi:
Speech Emotion and Naturalness Recognitions With Multitask and Single-Task Learnings. IEEE Access 10: 72381-72387 (2022) - [j11]Bagus Tris Atmaja, Akira Sasou:
Evaluating Self-Supervised Speech Representations for Speech Emotion Recognition. IEEE Access 10: 124396-124407 (2022) - [j10]Bagus Tris Atmaja, Akira Sasou:
Effects of Data Augmentations on Speech Emotion Recognition. Sensors 22(16): 5941 (2022) - [j9]Bagus Tris Atmaja, Akira Sasou:
Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations. Sensors 22(17): 6369 (2022) - [j8]Bagus Tris Atmaja, Akira Sasou, Masato Akagi:
Survey on bimodal speech emotion recognition from acoustic and linguistic information fusion. Speech Commun. 140: 11-28 (2022) - [c29]Bagus Tris Atmaja, Zanjabila, Akira Sasou:
Jointly Predicting Emotion, Age, and Country Using Pre-Trained Acoustic Embedding. ACIIW 2022: 1-6 - [i2]Bagus Tris Atmaja, Zanjabila, Suyanto, Akira Sasou:
Cross-dataset COVID-19 Transfer Learning with Cough Detection, Cough Segmentation, and Data Augmentation. CoRR abs/2210.05843 (2022) - [i1]Bagus Tris Atmaja, Akira Sasou:
Effect of different splitting criteria on the performance of speech emotion recognition. CoRR abs/2210.14501 (2022) - 2021
- [j7]Akira Sasou:
Deep Residual Learning With Dilated Causal Convolution Extreme Learning Machine. IEEE Access 9: 165708-165718 (2021) - [c28]Bagus Tris Atmaja, Akira Sasou, Masato Akagi:
Automatic Naturalness Recognition from Acted Speech Using Neural Networks. APSIPA ASC 2021: 731-736 - [c27]Bagus Tris Atmaja, Akira Sasou:
Effect of different splitting criteria on the performance of speech emotion recognition. TENCON 2021: 760-764
2010 – 2019
- 2019
- [c26]Shumpei Matsuoka, Yao Jiang, Akira Sasou:
Generation of Artificial FO-contours of Emotional Speech with Generative Adversarial Networks. SSCI 2019: 1030-1034 - 2018
- [j6]Akira Sasou:
Glottal inverse filtering by combining a constrained LP and an HMM-based generative model of glottal flow derivative. Speech Commun. 104: 113-128 (2018) - [c25]Akira Sasou, Nyamerdene Odontsengel, Shumpei Matsuoka:
An Acoustic-based Tracking System for Monitoring Elderly People Living Alone. ICT4AWE 2018: 89-95 - 2017
- [c24]Akira Sasou:
Automatic identification of pathological voice quality based on the GRBAS categorization. APSIPA 2017: 1243-1247 - 2016
- [c23]Akira Sasou:
Voice-pathology analysis based on AR-HMM. APSIPA 2016: 1-4 - 2014
- [c22]Akira Sasou:
Accuracy evaluation of esophageal voice analysis based on automatic topology generated-voicing source HMM. INTERSPEECH 2014: 1381-1385 - 2013
- [c21]Akira Sasou:
Evaluation of fundamental validity in applying AR-HMM with automatic topology generation to pathology voice analysis. INTERSPEECH 2013: 1673-1676 - 2012
- [c20]Akira Sasou:
Automatic Topology Generation of Glottal Source HMM. INTERSPEECH 2012: 1616-1619 - [c19]Akira Sasou, Nyamerdene Odontsengel:
Acoustic novelty detection based on AHLAC and NMF. ISPACS 2012: 872-875 - 2011
- [c18]Akira Sasou:
Powered Wheelchair Control Using Acoustic-Based Recognition of Head Gesture Accompanying Speech. INTERSPEECH 2011: 3029-3032 - [c17]Akira Sasou:
Acoustic surveillance based on Higher-order Local Auto-Correlation. MLSP 2011: 1-5 - 2010
- [c16]Akira Sasou, Yasuharu Hashimoto, Katsuhiko Sakaue:
Acoustic head gesture recognition and its applications. AVSP 2010: 3 - [c15]Akira Sasou, Yasuharu Hashimoto, Katsuhiko Sakaue:
Acoustic-based recognition of head gestures accompanying speech. INTERSPEECH 2010: 506-509
2000 – 2009
- 2009
- [j5]Akira Sasou, Hiroaki Kojima:
Noise Robust Speech Recognition Applied to Voice-Driven Wheelchair. EURASIP J. Adv. Signal Process. 2009 (2009) - [c14]Yasuharu Hashimoto, Akira Sasou:
Development of a 3D pointing voice interface using a three-axis microphone array. RO-MAN 2009: 1186-1191 - [c13]Akira Sasou:
Acoustic head orientation estimation applied to powered wheelchair control. ROBOCOMM 2009: 1-6 - 2008
- [c12]Akira Sasou:
Head-Orientation-Estimation-Integrated Speech Recognition for the Smart-Chair. ISUC 2008: 482-489 - 2007
- [c11]Akira Sasou, Hiroaki Kojima:
Noise robust speech recognition for voice driven wheelchair. INTERSPEECH 2007: 250-253 - 2006
- [j4]Akira Sasou, Futoshi Asano, Satoshi Nakamura, Kazuyo Tanaka:
HMM-based noise-robust feature compensation. Speech Commun. 48(9): 1100-1111 (2006) - [c10]Akira Sasou:
Singing voice recognition considering high-pitched and prolonged sounds. EUSIPCO 2006: 1-4 - 2005
- [j3]Satoshi Nakamura, Kazuya Takeda, Kazumasa Yamamoto, Takeshi Yamada, Shingo Kuroiwa, Norihide Kitaoka, Takanobu Nishiura, Akira Sasou, Mitsunori Mizumachi, Chiyomi Miyajima, Masakiyo Fujimoto, Toshiki Endo:
AURORA-2J: An Evaluation Framework for Japanese Noisy Speech Recognition. IEICE Trans. Inf. Syst. 88-D(3): 535-544 (2005) - [c9]Akira Sasou, Masataka Goto, Satoru Hayamizu, Kazuyo Tanaka:
An Auto-Regressive, Non-Stationary Excited Signal Parameter Estimation Method and an Evaluation of a Singing-Voice Recognition. ICASSP (1) 2005: 237-240 - [c8]Masakiyo Fujimoto, Satoshi Nakamura, Toshiki Endo, Kazuya Takeda, Chiyomi Miyajima, Shingo Kuroiwa, Takeshi Yamada, Norihide Kitaoka, Kazumasa Yamamoto, Mitsunori Mizumachi, Takanobu Nishiura, Akira Sasou:
CENSREC-3: Data Collection for In-Car Speech Recognition and Its Common Evaluation Framework. ICDE Workshops 2005: 1208 - 2004
- [c7]Akira Sasou, Kazuyo Tanaka, Satoshi Nakamura, Futoshi Asano:
HMM-based feature compensation method: an evaluation using the AURORA2. INTERSPEECH 2004: 121-124 - 2003
- [j2]Akira Sasou, Kazuyo Tanaka:
A waveform generation model-based approach for segregation of monaural mixed sound. Signal Process. 83(3): 561-574 (2003) - [c6]Akira Sasou, Futoshi Asano, Kazuyo Tanaka, Satoshi Nakamura:
Adaptation of acoustic model using the gain-adapted HMM decomposition method. INTERSPEECH 2003: 29-32 - 2002
- [c5]Akira Sasou, Kazuyo Tanaka:
A waveform generation model based approach for segregation of monaural mixture sound. EUSIPCO 2002: 1-4 - [c4]Akira Sasou, Kazuyo Tanaka:
Adaptive estimation of time-varying features from high-pitched speech based on an excitation source HMM. INTERSPEECH 2002: 2161-2164 - 2001
- [c3]Akira Sasou, Kazuyo Tanaka:
Robust LP analysis using glottal source HMM with application to high-pitched and noise corrupted speech. INTERSPEECH 2001: 2443-2446 - 2000
- [c2]Hiroshi Ohmura, Akira Sasou, Kazuyo Tanaka:
A low bit rate speech coding method using a formant-articulatory parameter nomogram. INTERSPEECH 2000: 202-205 - [c1]Akira Sasou, Kazuyo Tanaka:
Glottal excitation modeling using HMM with application to robust analysis of speech signal. INTERSPEECH 2000: 704-707
1990 – 1999
- 1996
- [j1]Hiroto Saito, Isao Umoto, Akira Sasou, Shogo Nakamura, Yoshihiko Horio, Tahiro Kubota:
Subadaptive piecewise linear quantization for speech signal (64 kbit/s) compression. IEEE Trans. Speech Audio Process. 4(5): 379-382 (1996)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-10 20:48 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint