default search action

combined dblp search
author search
venue search
publication search

ask others

Kentaro Tachibana

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShimizuYKSDKT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShimizuYKSDKT24
Reo Shimizu, Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana:
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions. ICASSP 2024: 12672-12676
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07254
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07254
Yuki Saito, Takuto Igarashi, Kentaro Seki, Shinnosuke Takamichi, Ryuichi Yamamoto, Kentaro Tachibana, Hiroshi Saruwatari:
SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark. CoRR abs/2406.07254 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07280
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07280
Takuto Igarashi, Yuki Saito, Kentaro Seki, Shinnosuke Takamichi, Ryuichi Yamamoto, Kentaro Tachibana, Hiroshi Saruwatari:
Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment. CoRR abs/2406.07280 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07969
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07969
Masaya Kawamura, Ryuichi Yamamoto, Yuma Shirahata, Takuya Hasumi, Kentaro Tachibana:
LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning. CoRR abs/2406.07969 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-17452
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-17452
Ryuichi Yamamoto, Yuma Shirahata, Masaya Kawamura, Kentaro Tachibana:
Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control. CoRR abs/2409.17452 (2024)
2023
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KawamuraSYT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KawamuraSYT23
Masaya Kawamura, Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana:
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform. ICASSP 2023: 1-5
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShirahataYSTKT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShirahataYSTKT23
Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song, Ryo Terashima, Jae-Min Kim, Kentaro Tachibana:
Period VITS: Variational Inference with Explicit Pitch Modeling for End-To-End Emotional Speech Synthesis. ICASSP 2023: 1-5
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YoneyamaYT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YoneyamaYT23
Reo Yoneyama, Ryuichi Yamamoto, Kentaro Tachibana:
Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs. ICASSP 2023: 1-5
[c16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaitoTITS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaitoTITS23
Yuki Saito, Shinnosuke Takamichi, Eiji Iimori, Kentaro Tachibana, Hiroshi Saruwatari:
ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings. INTERSPEECH 2023: 3048-3052
[c15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaitoITTS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaitoITTS23
Yuki Saito, Eiji Iimori, Shinnosuke Takamichi, Kentaro Tachibana, Hiroshi Saruwatari:
CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center. INTERSPEECH 2023: 5561-5565
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13713
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13713
Yuki Saito, Eiji Iimori, Shinnosuke Takamichi, Kentaro Tachibana, Hiroshi Saruwatari:
CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center. CoRR abs/2305.13713 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13724
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13724
Yuki Saito, Shinnosuke Takamichi, Eiji Iimori, Kentaro Tachibana, Hiroshi Saruwatari:
ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings. CoRR abs/2305.13724 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08140
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08140
Reo Shimizu, Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana:
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions. CoRR abs/2309.08140 (2023)
2022
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaekiTY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaekiTY22
Takaaki Saeki, Kentaro Tachibana, Ryuichi Yamamoto:
DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning. INTERSPEECH 2022: 793-797
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkYT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkYT22
Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana:
A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech. INTERSPEECH 2022: 1931-1935
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TerashimaYSSYKT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TerashimaYSSYKT22
Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song, Yuma Shirahata, Hyun-Wook Yoon, Jae-Min Kim, Kentaro Tachibana:
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation. INTERSPEECH 2022: 3018-3022
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NishimuraSTTS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NishimuraSTTS22
Yuto Nishimura, Yuki Saito, Shinnosuke Takamichi, Kentaro Tachibana, Hiroshi Saruwatari:
Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History. INTERSPEECH 2022: 3373-3377
[c10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaitoNTTS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaitoNTTS22
Yuki Saito, Yuto Nishimura, Shinnosuke Takamichi, Kentaro Tachibana, Hiroshi Saruwatari:
STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent. INTERSPEECH 2022: 5155-5159
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-14757
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-14757
Yuki Saito, Yuto Nishimura, Shinnosuke Takamichi, Kentaro Tachibana, Hiroshi Saruwatari:
STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent. CoRR abs/2203.14757 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15683
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15683
Takaaki Saeki, Kentaro Tachibana, Ryuichi Yamamoto:
DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning. CoRR abs/2203.15683 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-10020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-10020
Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song, Yuma Shirahata, Hyun-Wook Yoon, Jae-Min Kim, Kentaro Tachibana:
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation. CoRR abs/2204.10020 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-08039
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-08039
Yuto Nishimura, Yuki Saito, Shinnosuke Takamichi, Kentaro Tachibana, Hiroshi Saruwatari:
Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History. CoRR abs/2206.08039 (2022)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15887
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15887
Reo Yoneyama, Ryuichi Yamamoto, Kentaro Tachibana:
Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs. CoRR abs/2210.15887 (2022)
[i4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15964
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15964
Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song, Ryo Terashima, Jae-Min Kim, Kentaro Tachibana:
Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis. CoRR abs/2210.15964 (2022)
[i3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15975
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15975
Masaya Kawamura, Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana:
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform. CoRR abs/2210.15975 (2022)
2021
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FutamataPYT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FutamataPYT21
Kosuke Futamata, Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana:
Phrase Break Prediction with Bidirectional Encoder Representations in Japanese Text-to-Speech Synthesis. Interspeech 2021: 3126-3130
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-12395
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-12395
Kosuke Futamata, Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana:
Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis. CoRR abs/2104.12395 (2021)
2020
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicetd/SaitoAT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicetd/SaitoAT20
Yuki Saito, Kei Akuzawa, Kentaro Tachibana:
Joint Adversarial Training of Speech Recognition and Synthesis Models for Many-to-One Voice Conversion Using Phonetic Posteriorgrams. IEICE Trans. Inf. Syst. 103-D(9): 1978-1987 (2020)
[c8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GotoOSTM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GotoOSTM20
Shunsuke Goto, Kotaro Onishi, Yuki Saito, Kentaro Tachibana, Koichiro Mori:
Face2Speech: Towards Multi-Speaker Text-to-Speech Synthesis Using an Embedding Vector Predicted from a Face Image. INTERSPEECH 2020: 1321-1325
[p1]
- view
  authority control:
- export record
  dblp key:
  - series/sbcs/ShigaNTO20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/series/sbcs/ShigaNTO20
Yoshinori Shiga, Jinfu Ni, Kentaro Tachibana, Takuma Okamoto:
Text-to-Speech Synthesis. Speech-to-Speech Translation 2020: 39-52

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2018
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/HamadaTLHU18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/HamadaTLHU18
Koichi Hamada, Kentaro Tachibana, Tianqi Li, Hiroto Honda, Yusuke Uchida:
Full-Body High-Resolution Anime Generation with Progressive Structure-Conditional Generative Adversarial Networks. ECCV Workshops (3) 2018: 67-74
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/OkamotoTTSK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/OkamotoTTSK18
Takuma Okamoto, Kentaro Tachibana, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
An Investigation of Subband Wavenet Vocoder Covering Entire Audible Frequency Range with Limited Acoustic Features. ICASSP 2018: 5654-5658
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TachibanaTSK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TachibanaTSK18
Kentaro Tachibana, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
An Investigation of Noise Shaping with Perceptual Weighting for Wavenet-Based Speech Generation. ICASSP 2018: 5664-5668
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1809-01890
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-01890
Koichi Hamada, Kentaro Tachibana, Tianqi Li, Hiroto Honda, Yusuke Uchida:
Full-body High-resolution Anime Generation with Progressive Structure-conditional Generative Adversarial Networks. CoRR abs/1809.01890 (2018)
2017
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/OkamotoTTSK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/OkamotoTTSK17
Takuma Okamoto, Kentaro Tachibana, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
Subband wavenet with overlapped single-sideband filterbanks. ASRU 2017: 698-704
2016
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TachibanaTSK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TachibanaTSK16
Kentaro Tachibana, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
Model Integration for HMM- and DNN-Based Speech Synthesis Using Product-of-Experts Framework. INTERSPEECH 2016: 2288-2292

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TakahashiSFTMMST09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TakahashiSFTMMST09
Yu Takahashi, Hiroshi Saruwatari, Yuki Fujihara, Kentaro Tachibana, Yoshimitsu Mori, Shigeki Miyabe, Kiyohiro Shikano, Akira Tanaka:
Source adaptive blind signal extraction using closed-form ICA for hands-free robot spoken dialogue system. ICASSP 2009: 3681-3684
2007
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TachibanaSMMST07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TachibanaSMMST07
Kentaro Tachibana, Hiroshi Saruwatari, Yoshimitsu Mori, Shigeki Miyabe, Kiyohiro Shikano, Akira Tanaka:
Efficient Blind Source Separation Combining Closed-Form Second-Order ICA and Nonclosed-Form Higher-Order ICA. ICASSP (1) 2007: 45-48

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.