default search action

combined dblp search
author search
venue search
publication search

ask others

Yosuke Higuchi

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ArigaHHOO24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ArigaHHOO24
Tomoki Ariga, Yosuke Higuchi, Kazutoshi Hayasaka, Naoki Okamoto, Tetsuji Ogawa:
Parody Detection Using Source-Target Attention with Teacher-Forced Lyrics. ICASSP 2024: 1151-1155
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-19990
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-19990
Oswald Zink, Yosuke Higuchi, Carlos Mullov, Alexander Waibel, Tetsunori Kobayashi:
Predictive Speech Recognition and End-of-Utterance Detection Towards Spoken Dialog Systems. CoRR abs/2409.19990 (2024)
2023
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/HiguchiRWBR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/HiguchiRWBR23
Yosuke Higuchi, Andrew Rosenberg, Yuan Wang, Murali Karthick Baskar, Bhuvana Ramabhadran:
Mask-Conformer: Augmenting Conformer with Mask-Predict Decoder. ASRU 2023: 1-8
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SomekiEHW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SomekiEHW23
Masao Someki, Nicholas Eng, Yosuke Higuchi, Shinji Watanabe:
Segment-Level Vectorized Beam Search Based on Partially Autoregressive Inference. ASRU 2023: 1-8
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/eacl/YanDHNMBW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eacl/YanDHNMBW23
Brian Yan, Siddharth Dalmia, Yosuke Higuchi, Graham Neubig, Florian Metze, Alan W. Black, Shinji Watanabe:
CTC Alignments Improve Autoregressive Translation. EACL 2023: 1615-1631
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/ZhaoHKOK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/ZhaoHKOK23
Huaibo Zhao, Yosuke Higuchi, Yusuke Kida, Tetsuji Ogawa, Tetsunori Kobayashi:
Mask-CTC-Based Encoder Pre-Training for Streaming End-to-End Speech Recognition. EUSIPCO 2023: 56-60
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/ArigaHKSMOO23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/ArigaHKSMOO23
Tomoki Ariga, Yosuke Higuchi, Mitsunori Kanno, Rie Shigyo, Takato Mizuguchi, Naoki Okamoto, Tetsuji Ogawa:
Spotting Parodies: Detecting Alignment Collapse Between Lyrics and Singing Voice. EUSIPCO 2023: 286-290
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HiguchiOKW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HiguchiOKW23
Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi, Shinji Watanabe:
Intermpl: Momentum Pseudo-Labeling With Intermediate CTC Loss. ICASSP 2023: 1-5
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HiguchiOKW23a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HiguchiOKW23a
Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi, Shinji Watanabe:
BECTRA: Transducer-Based End-To-End ASR with Bert-Enhanced Encoder. ICASSP 2023: 1-5
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-04654
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-04654
Huaibo Zhao, Yosuke Higuchi, Yusuke Kida, Tetsuji Ogawa, Tetsunori Kobayashi:
Mask-CTC-based Encoder Pre-training for Streaming End-to-End Speech Recognition. CoRR abs/2309.04654 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10524
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10524
Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi:
Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition. CoRR abs/2309.10524 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-14922
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-14922
Masao Someki, Nicholas Eng, Yosuke Higuchi, Shinji Watanabe:
Segment-Level Vectorized Beam Search Based on Partially Autoregressive Inference. CoRR abs/2309.14922 (2023)
2022
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jstsp/HiguchiMRH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/HiguchiMRH22
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori:
Momentum Pseudo-Labeling: Semi-Supervised ASR With Continuously Improving Pseudo-Labels. IEEE J. Sel. Top. Signal Process. 16(6): 1424-1438 (2022)
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/HiguchiYAOK022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/HiguchiYAOK022
Yosuke Higuchi, Brian Yan, Siddhant Arora, Tetsuji Ogawa, Tetsunori Kobayashi, Shinji Watanabe:
BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model. EMNLP (Findings) 2022: 5486-5503
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HiguchiMRH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HiguchiMRH22
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori:
Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy. ICASSP 2022: 7672-7676
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HiguchiKOK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HiguchiKOK22
Yosuke Higuchi, Keita Karube, Tetsuji Ogawa, Tetsunori Kobayashi:
Hierarchical Conditional End-to-End ASR with CTC and Multi-Granular Subword Units. ICASSP 2022: 7797-7801
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DengYWHCZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DengYWHCZ22
Keqi Deng, Zehui Yang, Shinji Watanabe, Yosuke Higuchi, Gaofeng Cheng, Pengyuan Zhang:
Improving Non-Autoregressive End-to-End Speech Recognition with Pre-Trained Acoustic and Language Models. ICASSP 2022: 8522-8526
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/PengAHUKGDCW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/PengAHUKGDCW22
Yifan Peng, Siddhant Arora, Yosuke Higuchi, Yushi Ueda, Sujay Kumar, Karthik Ganesan, Siddharth Dalmia, Xuankai Chang, Shinji Watanabe:
A Study on the Integration of Pre-Trained SSL, ASR, LM and SLU Models for Spoken Language Understanding. SLT 2022: 406-413
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-10103
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-10103
Keqi Deng, Zehui Yang, Shinji Watanabe, Yosuke Higuchi, Gaofeng Cheng, Pengyuan Zhang:
Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models. CoRR abs/2201.10103 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-05200
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-05200
Brian Yan, Siddharth Dalmia, Yosuke Higuchi, Graham Neubig, Florian Metze, Alan W. Black, Shinji Watanabe:
CTC Alignments Improve Autoregressive Translation. CoRR abs/2210.05200 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-16663
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-16663
Yosuke Higuchi, Brian Yan, Siddhant Arora, Tetsuji Ogawa, Tetsunori Kobayashi, Shinji Watanabe:
BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model. CoRR abs/2210.16663 (2022)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00792
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00792
Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi, Shinji Watanabe:
BECTRA: Transducer-based End-to-End ASR with BERT-Enhanced Encoder. CoRR abs/2211.00792 (2022)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00795
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00795
Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi, Shinji Watanabe:
InterMPL: Momentum Pseudo-Labeling with Intermediate CTC Loss. CoRR abs/2211.00795 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-05869
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-05869
Yifan Peng, Siddhant Arora, Yosuke Higuchi, Yushi Ueda, Sujay Kumar, Karthik Ganesan, Siddharth Dalmia, Xuankai Chang, Shinji Watanabe:
A Study on the Integration of Pre-trained SSL, ASR, LM and SLU Models for Spoken Language Understanding. CoRR abs/2211.05869 (2022)
2021
[c10]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/ZhaoHOK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ZhaoHOK21
Huaibo Zhao, Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi:
An Investigation of Enhancing CTC Model for Triggered Attention-based Streaming ASR. APSIPA ASC 2021: 477-483
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/HiguchiCFIKLNWW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/HiguchiCFIKLNWW21
Yosuke Higuchi, Nanxin Chen, Yuya Fujita, Hirofumi Inaguma, Tatsuya Komatsu, Jaesong Lee, Jumon Nozaki, Tianzi Wang, Shinji Watanabe:
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation. ASRU 2021: 47-54
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuoBCHHIKLGSSWW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuoBCHHIKLGSSWW21
Pengcheng Guo, Florian Boyer, Xuankai Chang, Tomoki Hayashi, Yosuke Higuchi, Hirofumi Inaguma, Naoyuki Kamo, Chenda Li, Daniel Garcia-Romero, Jiatong Shi, Jing Shi, Shinji Watanabe, Kun Wei, Wangyou Zhang, Yuekai Zhang:
Recent Developments on Espnet Toolkit Boosted By Conformer. ICASSP 2021: 5874-5878
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/InagumaHDK021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/InagumaHDK021
Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe:
ORTHROS: non-autoregressive end-to-end speech translation With dual-decoder. ICASSP 2021: 7503-7507
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HiguchiI0OK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HiguchiI0OK21
Yosuke Higuchi, Hirofumi Inaguma, Shinji Watanabe, Tetsuji Ogawa, Tetsunori Kobayashi:
Improved Mask-CTC for Non-Autoregressive End-to-End ASR. ICASSP 2021: 8363-8367
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HiguchiMRH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HiguchiMRH21
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori:
Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition. Interspeech 2021: 726-730
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-08922
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-08922
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori:
Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition. CoRR abs/2106.08922 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-04411
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-04411
Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe:
Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring. CoRR abs/2109.04411 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04109
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04109
Yosuke Higuchi, Keita Karube, Tetsuji Ogawa, Tetsunori Kobayashi:
Hierarchical Conditional End-to-End ASR with CTC and Multi-Granular Subword Units. CoRR abs/2110.04109 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04948
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04948
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori:
Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy. CoRR abs/2110.04948 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05249
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05249
Yosuke Higuchi, Nanxin Chen, Yuya Fujita, Hirofumi Inaguma, Tatsuya Komatsu, Jaesong Lee, Jumon Nozaki, Tianzi Wang, Shinji Watanabe:
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation. CoRR abs/2110.05249 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-10402
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-10402
Huaibo Zhao, Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi:
An Investigation of Enhancing CTC Model for Triggered Attention-based Streaming ASR. CoRR abs/2110.10402 (2021)
2020
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/HiguchiTOIKO20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/HiguchiTOIKO20
Yosuke Higuchi, Naohiro Tawara, Atsunori Ogawa, Tomoharu Iwata, Tetsunori Kobayashi, Tetsuji Ogawa:
Noise-robust Attention Learning for End-to-End Speech Recognition. EUSIPCO 2020: 311-315
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HiguchiSK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HiguchiSK20
Yosuke Higuchi, Masayuki Suzuki, Gakuto Kurata:
Speaker Embeddings Incorporating Acoustic Conditions for Diarization. ICASSP 2020: 7129-7133
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Higuchi0COK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Higuchi0COK20
Yosuke Higuchi, Shinji Watanabe, Nanxin Chen, Tetsuji Ogawa, Tetsunori Kobayashi:
Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask Predict. INTERSPEECH 2020: 3655-3659
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08700
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08700
Yosuke Higuchi, Shinji Watanabe, Nanxin Chen, Tetsuji Ogawa, Tetsunori Kobayashi:
Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask Predict. CoRR abs/2005.08700 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-13047
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-13047
Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe:
Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder. CoRR abs/2010.13047 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-13270
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-13270
Yosuke Higuchi, Hirofumi Inaguma, Shinji Watanabe, Tetsuji Ogawa, Tetsunori Kobayashi:
Improved Mask-CTC for Non-Autoregressive End-to-End ASR. CoRR abs/2010.13270 (2020)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-13956
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-13956
Pengcheng Guo, Florian Boyer, Xuankai Chang, Tomoki Hayashi, Yosuke Higuchi, Hirofumi Inaguma, Naoyuki Kamo, Chenda Li, Daniel Garcia-Romero, Jiatong Shi, Jing Shi, Shinji Watanabe, Kun Wei, Wangyou Zhang, Yuekai Zhang:
Recent Developments on ESPnet Toolkit Boosted by Conformer. CoRR abs/2010.13956 (2020)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-13006
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-13006
Shinji Watanabe, Florian Boyer, Xuankai Chang, Pengcheng Guo, Tomoki Hayashi, Yosuke Higuchi, Takaaki Hori, Wen-Chin Huang, Hirofumi Inaguma, Naoyuki Kamo, Shigeki Karita, Chenda Li, Jing Shi, Aswin Shanmugam Subramanian, Wangyou Zhang:
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans. CoRR abs/2012.13006 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HiguchiTKO19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HiguchiTKO19
Yosuke Higuchi, Naohiro Tawara, Tetsunori Kobayashi, Tetsuji Ogawa:
Speaker Adversarial Training of DPGMM-Based Feature Extractor for Zero-Resource Languages. INTERSPEECH 2019: 266-270

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.