default search action
Kentaro Tachibana
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c20]Reo Shimizu, Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana:
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions. ICASSP 2024: 12672-12676 - [i16]Yuki Saito, Takuto Igarashi, Kentaro Seki, Shinnosuke Takamichi, Ryuichi Yamamoto, Kentaro Tachibana, Hiroshi Saruwatari:
SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark. CoRR abs/2406.07254 (2024) - [i15]Takuto Igarashi, Yuki Saito, Kentaro Seki, Shinnosuke Takamichi, Ryuichi Yamamoto, Kentaro Tachibana, Hiroshi Saruwatari:
Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment. CoRR abs/2406.07280 (2024) - [i14]Masaya Kawamura, Ryuichi Yamamoto, Yuma Shirahata, Takuya Hasumi, Kentaro Tachibana:
LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning. CoRR abs/2406.07969 (2024) - [i13]Ryuichi Yamamoto, Yuma Shirahata, Masaya Kawamura, Kentaro Tachibana:
Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control. CoRR abs/2409.17452 (2024) - 2023
- [c19]Masaya Kawamura, Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana:
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform. ICASSP 2023: 1-5 - [c18]Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song, Ryo Terashima, Jae-Min Kim, Kentaro Tachibana:
Period VITS: Variational Inference with Explicit Pitch Modeling for End-To-End Emotional Speech Synthesis. ICASSP 2023: 1-5 - [c17]Reo Yoneyama, Ryuichi Yamamoto, Kentaro Tachibana:
Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs. ICASSP 2023: 1-5 - [c16]Yuki Saito, Shinnosuke Takamichi, Eiji Iimori, Kentaro Tachibana, Hiroshi Saruwatari:
ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings. INTERSPEECH 2023: 3048-3052 - [c15]Yuki Saito, Eiji Iimori, Shinnosuke Takamichi, Kentaro Tachibana, Hiroshi Saruwatari:
CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center. INTERSPEECH 2023: 5561-5565 - [i12]Yuki Saito, Eiji Iimori, Shinnosuke Takamichi, Kentaro Tachibana, Hiroshi Saruwatari:
CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center. CoRR abs/2305.13713 (2023) - [i11]Yuki Saito, Shinnosuke Takamichi, Eiji Iimori, Kentaro Tachibana, Hiroshi Saruwatari:
ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings. CoRR abs/2305.13724 (2023) - [i10]Reo Shimizu, Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana:
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions. CoRR abs/2309.08140 (2023) - 2022
- [c14]Takaaki Saeki, Kentaro Tachibana, Ryuichi Yamamoto:
DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning. INTERSPEECH 2022: 793-797 - [c13]Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana:
A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech. INTERSPEECH 2022: 1931-1935 - [c12]Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song, Yuma Shirahata, Hyun-Wook Yoon, Jae-Min Kim, Kentaro Tachibana:
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation. INTERSPEECH 2022: 3018-3022 - [c11]Yuto Nishimura, Yuki Saito, Shinnosuke Takamichi, Kentaro Tachibana, Hiroshi Saruwatari:
Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History. INTERSPEECH 2022: 3373-3377 - [c10]Yuki Saito, Yuto Nishimura, Shinnosuke Takamichi, Kentaro Tachibana, Hiroshi Saruwatari:
STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent. INTERSPEECH 2022: 5155-5159 - [i9]Yuki Saito, Yuto Nishimura, Shinnosuke Takamichi, Kentaro Tachibana, Hiroshi Saruwatari:
STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent. CoRR abs/2203.14757 (2022) - [i8]Takaaki Saeki, Kentaro Tachibana, Ryuichi Yamamoto:
DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning. CoRR abs/2203.15683 (2022) - [i7]Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song, Yuma Shirahata, Hyun-Wook Yoon, Jae-Min Kim, Kentaro Tachibana:
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation. CoRR abs/2204.10020 (2022) - [i6]Yuto Nishimura, Yuki Saito, Shinnosuke Takamichi, Kentaro Tachibana, Hiroshi Saruwatari:
Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History. CoRR abs/2206.08039 (2022) - [i5]Reo Yoneyama, Ryuichi Yamamoto, Kentaro Tachibana:
Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs. CoRR abs/2210.15887 (2022) - [i4]Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song, Ryo Terashima, Jae-Min Kim, Kentaro Tachibana:
Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis. CoRR abs/2210.15964 (2022) - [i3]Masaya Kawamura, Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana:
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform. CoRR abs/2210.15975 (2022) - 2021
- [c9]Kosuke Futamata, Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana:
Phrase Break Prediction with Bidirectional Encoder Representations in Japanese Text-to-Speech Synthesis. Interspeech 2021: 3126-3130 - [i2]Kosuke Futamata, Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana:
Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis. CoRR abs/2104.12395 (2021) - 2020
- [j1]Yuki Saito, Kei Akuzawa, Kentaro Tachibana:
Joint Adversarial Training of Speech Recognition and Synthesis Models for Many-to-One Voice Conversion Using Phonetic Posteriorgrams. IEICE Trans. Inf. Syst. 103-D(9): 1978-1987 (2020) - [c8]Shunsuke Goto, Kotaro Onishi, Yuki Saito, Kentaro Tachibana, Koichiro Mori:
Face2Speech: Towards Multi-Speaker Text-to-Speech Synthesis Using an Embedding Vector Predicted from a Face Image. INTERSPEECH 2020: 1321-1325 - [p1]Yoshinori Shiga, Jinfu Ni, Kentaro Tachibana, Takuma Okamoto:
Text-to-Speech Synthesis. Speech-to-Speech Translation 2020: 39-52
2010 – 2019
- 2018
- [c7]Koichi Hamada, Kentaro Tachibana, Tianqi Li, Hiroto Honda, Yusuke Uchida:
Full-Body High-Resolution Anime Generation with Progressive Structure-Conditional Generative Adversarial Networks. ECCV Workshops (3) 2018: 67-74 - [c6]Takuma Okamoto, Kentaro Tachibana, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
An Investigation of Subband Wavenet Vocoder Covering Entire Audible Frequency Range with Limited Acoustic Features. ICASSP 2018: 5654-5658 - [c5]Kentaro Tachibana, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
An Investigation of Noise Shaping with Perceptual Weighting for Wavenet-Based Speech Generation. ICASSP 2018: 5664-5668 - [i1]Koichi Hamada, Kentaro Tachibana, Tianqi Li, Hiroto Honda, Yusuke Uchida:
Full-body High-resolution Anime Generation with Progressive Structure-conditional Generative Adversarial Networks. CoRR abs/1809.01890 (2018) - 2017
- [c4]Takuma Okamoto, Kentaro Tachibana, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
Subband wavenet with overlapped single-sideband filterbanks. ASRU 2017: 698-704 - 2016
- [c3]Kentaro Tachibana, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
Model Integration for HMM- and DNN-Based Speech Synthesis Using Product-of-Experts Framework. INTERSPEECH 2016: 2288-2292
2000 – 2009
- 2009
- [c2]Yu Takahashi, Hiroshi Saruwatari, Yuki Fujihara, Kentaro Tachibana, Yoshimitsu Mori, Shigeki Miyabe, Kiyohiro Shikano, Akira Tanaka:
Source adaptive blind signal extraction using closed-form ICA for hands-free robot spoken dialogue system. ICASSP 2009: 3681-3684 - 2007
- [c1]Kentaro Tachibana, Hiroshi Saruwatari, Yoshimitsu Mori, Shigeki Miyabe, Kiyohiro Shikano, Akira Tanaka:
Efficient Blind Source Separation Combining Closed-Form Second-Order ICA and Nonclosed-Form Higher-Order ICA. ICASSP (1) 2007: 45-48
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-22 20:16 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint