default search action

combined dblp search
author search
venue search
publication search

ask others

Vitaly Lavrukhin

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0089BGLBG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0089BGLBG24
Yang Zhang, Travis M. Bartley, Mariana Graterol-Fuenmayor, Vitaly Lavrukhin, Evelina Bakhturina, Boris Ginsburg:
A Chat about Boring Problems: Studying GPT-Based Text Normalization. ICASSP 2024: 10921-10925
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-06220
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-06220
Vladimir Bataev, Hainan Xu, Daniel Galvez, Vitaly Lavrukhin, Boris Ginsburg:
Label-Looping: Highly Efficient Decoding for Transducers. CoRR abs/2406.06220 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07096
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07096
Andrei Andrusenko, Aleksandr Laptev, Vladimir Bataev, Vitaly Lavrukhin, Boris Ginsburg:
Fast Context-Biasing for CTC and Transducer ASR models with CTC-based Word Spotter. CoRR abs/2406.07096 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-19674
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-19674
Krishna C. Puvvada, Piotr Zelasko, He Huang, Oleksii Hrinchuk, Nithin Rao Koluguri, Kunal Dhawan, Somshubra Majumdar, Elena Rastorgueva, Zhehuai Chen, Vitaly Lavrukhin, Jagadeesh Balam, Boris Ginsburg:
Less is More: Accurate Speech Recognition & Translation without Web-Scale Data. CoRR abs/2406.19674 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-11538
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-11538
Ke Hu, Zhehuai Chen, Chao-Han Huck Yang, Piotr Zelasko, Oleksii Hrinchuk, Vitaly Lavrukhin, Jagadeesh Balam, Boris Ginsburg:
Chain-of-Thought Prompting for Speech Translation. CoRR abs/2409.11538 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-13523
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-13523
Piotr Zelasko, Zhehuai Chen, Mengru Wang, Daniel Galvez, Oleksii Hrinchuk, Shuoyang Ding, Ke Hu, Jagadeesh Balam, Vitaly Lavrukhin, Boris Ginsburg:
EMMeTT: Efficient Multimodal Machine Translation Training. CoRR abs/2409.13523 (2024)
2023
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MeisterNKBLG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MeisterNKBLG23
Aleksandr Meister, Matvei Novikov, Nikolay Karpov, Evelina Bakhturina, Vitaly Lavrukhin, Boris Ginsburg:
LibriSpeech-PC: Benchmark for Evaluation of Punctuation and Capitalization Capabilities of End-to-End ASR Models. ASRU 2023: 1-7
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangPLG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangPLG23
Yang Zhang, Krishna C. Puvvada, Vitaly Lavrukhin, Boris Ginsburg:
Conformer-Based Target-Speaker Automatic Speech Recognition For Single-Channel Audio. ICASSP 2023: 1-5
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GitmanLLG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GitmanLLG23
Igor Gitman, Vitaly Lavrukhin, Aleksandr Laptev, Boris Ginsburg:
Confidence-based Ensembles of End-to-End Speech Recognition Models. INTERSPEECH 2023: 1414-1418
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BataevKSLG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BataevKSLG23
Vladimir Bataev, Roman Korostik, Evgeny Shabalin, Vitaly Lavrukhin, Boris Ginsburg:
Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator. INTERSPEECH 2023: 2928-2932
[c8]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/RastorguevaLG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RastorguevaLG23
Elena Rastorgueva, Vitaly Lavrukhin, Boris Ginsburg:
NeMo Forced Aligner and its application to word alignment for subtitle generation. INTERSPEECH 2023: 5257-5258
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-14036
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-14036
Vladimir Bataev, Roman Korostik, Evgeny Shabalin, Vitaly Lavrukhin, Boris Ginsburg:
Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator. CoRR abs/2302.14036 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-15824
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-15824
Igor Gitman, Vitaly Lavrukhin, Aleksandr Laptev, Boris Ginsburg:
Confidence-based Ensembles of End-to-End Speech Recognition Models. CoRR abs/2306.15824 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-05218
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-05218
Yang Zhang, Krishna C. Puvvada, Vitaly Lavrukhin, Boris Ginsburg:
Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio. CoRR abs/2308.05218 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13426
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13426
Yang Zhang, Travis M. Bartley, Mariana Graterol-Fuenmayor, Vitaly Lavrukhin, Evelina Bakhturina, Boris Ginsburg:
A Chat About Boring Problems: Studying GPT-based text normalization. CoRR abs/2309.13426 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-02943
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-02943
Aleksandr Meister, Matvei Novikov, Nikolay Karpov, Evelina Bakhturina, Vitaly Lavrukhin, Boris Ginsburg:
LibriSpeech-PC: Benchmark for Evaluation of Punctuation and Capitalization Capabilities of end-to-end ASR Models. CoRR abs/2310.02943 (2023)
2022
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MajumdarALG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MajumdarALG22
Somshubra Majumdar, Shantanu Acharya, Vitaly Lavrukhin, Boris Ginsburg:
Damage Control During Domain Adaptation for Transducer Based Automatic Speech Recognition. SLT 2022: 130-135
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-03255
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-03255
Somshubra Majumdar, Shantanu Acharya, Vitaly Lavrukhin, Boris Ginsburg:
Damage Control During Domain Adaptation for Transducer Based Automatic Speech Recognition. CoRR abs/2210.03255 (2022)
2021
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/LuoWCX0KOBDFGHK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/LuoWCX0KOBDFGHK21
Jian Luo, Jianzong Wang, Ning Cheng, Edward Xiao, Jing Xiao, Georg Kucsko, Patrick K. O'Neill, Jagadeesh Balam, Slyne Deng, Adriana Flores, Boris Ginsburg, Jocelyn Huang, Oleksii Kuchaiev, Vitaly Lavrukhin, Jason Li:
Cross-Language Transfer Learning and Domain Adaptation for End-to-End Automatic Speech Recognition. ICME 2021: 1-6
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ONeillLMNZKBDFS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ONeillLMNZKBDFS21
Patrick K. O'Neill, Vitaly Lavrukhin, Somshubra Majumdar, Vahid Noroozi, Yuekai Zhang, Oleksii Kuchaiev, Jagadeesh Balam, Yuliya Dovzhenko, Keenan Freyberg, Michael D. Shulman, Boris Ginsburg, Shinji Watanabe, Georg Kucsko:
SPGISpeech: 5, 000 Hours of Transcribed Financial Audio for Fully Formatted End-to-End Speech Recognition. Interspeech 2021: 1434-1438
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BakhturinaLGZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BakhturinaLGZ21
Evelina Bakhturina, Vitaly Lavrukhin, Boris Ginsburg, Yang Zhang:
Hi-Fi Multi-Speaker English TTS Dataset. Interspeech 2021: 2776-2780
[c3]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/BakhturinaLG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BakhturinaLG21
Evelina Bakhturina, Vitaly Lavrukhin, Boris Ginsburg:
A Toolbox for Construction and Analysis of Speech Datasets. NeurIPS Datasets and Benchmarks 2021
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02014
Patrick K. O'Neill, Vitaly Lavrukhin, Somshubra Majumdar, Vahid Noroozi, Yuekai Zhang, Oleksii Kuchaiev, Jagadeesh Balam, Yuliya Dovzhenko, Keenan Freyberg, Michael D. Shulman, Boris Ginsburg, Shinji Watanabe, Georg Kucsko:
SPGISpeech: 5, 000 hours of transcribed financial audio for fully formatted end-to-end speech recognition. CoRR abs/2104.02014 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-04896
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-04896
Evelina Bakhturina, Vitaly Lavrukhin, Boris Ginsburg:
NeMo Toolbox for Speech Dataset Construction. CoRR abs/2104.04896 (2021)
2020
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KrimanBGHKLLLZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KrimanBGHKLLLZ20
Samuel Kriman, Stanislav Beliaev, Boris Ginsburg, Jocelyn Huang, Oleksii Kuchaiev, Vitaly Lavrukhin, Ryan Leary, Jason Li, Yang Zhang:
Quartznet: Deep Automatic Speech Recognition with 1D Time-Channel Separable Convolutions. ICASSP 2020: 6124-6128

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiLGLKCNG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiLGLKCNG19
Jason Li, Vitaly Lavrukhin, Boris Ginsburg, Ryan Leary, Oleksii Kuchaiev, Jonathan M. Cohen, Huyen Nguyen, Ravi Teja Gadde:
Jasper: An End-to-End Convolutional Neural Acoustic Model. INTERSPEECH 2019: 71-75
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-03288
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-03288
Jason Li, Vitaly Lavrukhin, Boris Ginsburg, Ryan Leary, Oleksii Kuchaiev, Jonathan M. Cohen, Huyen Nguyen, Ravi Teja Gadde:
Jasper: An End-to-End Convolutional Neural Acoustic Model. CoRR abs/1904.03288 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-11286
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-11286
Boris Ginsburg, Patrice Castonguay, Oleksii Hrinchuk, Oleksii Kuchaiev, Vitaly Lavrukhin, Ryan Leary, Jason Li, Huyen Nguyen, Jonathan M. Cohen:
Stochastic Gradient Methods with Layer-wise Adaptive Moments for Training of Deep Networks. CoRR abs/1905.11286 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-09577
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-09577
Oleksii Kuchaiev, Jason Li, Huyen Nguyen, Oleksii Hrinchuk, Ryan Leary, Boris Ginsburg, Samuel Kriman, Stanislav Beliaev, Vitaly Lavrukhin, Jack Cook, Patrice Castonguay, Mariya Popova, Jocelyn Huang, Jonathan M. Cohen:
NeMo: a toolkit for building AI applications using Neural Modules. CoRR abs/1909.09577 (2019)
2018
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-10387
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-10387
Oleksii Kuchaiev, Boris Ginsburg, Igor Gitman, Vitaly Lavrukhin, Carl Case, Paulius Micikevicius:
OpenSeq2Seq: extensible toolkit for distributed and mixed precision training of sequence-to-sequence models. CoRR abs/1805.10387 (2018)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-00707
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-00707
Jason Li, Ravi Gadde, Boris Ginsburg, Vitaly Lavrukhin:
Training Neural Speech Recognition Systems with Synthetic Speech Augmentation. CoRR abs/1811.00707 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.