Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages Y Zhang, W Han, J Qin, Y Wang, A Bapna, Z Chen, N Chen, B Li, ... arXiv preprint arXiv:2303.01037, 2023 | 200 | 2023 |
A long-short term memory recurrent neural network based reinforcement learning controller for office heating ventilation and air conditioning systems Y Wang, K Velswamy, B Huang Processes 5 (3), 46, 2017 | 194 | 2017 |
Improving Speech Recognition Using Consistent Predictions on Synthesized Speech G Wang, A Rosenberg, Z Chen, Y Zhang, B Ramabhadran, Y Wu, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 59 | 2020 |
A Novel Approach to Feedback Control with Deep Reinforcement Learning Y Wang, K Velswamy, B Huang IFAC-PapersOnLine 51 (18), 31-36, 2018 | 57 | 2018 |
Improving Speech Recognition Using GAN-Based Speech Synthesis and Contrastive Unspoken Text Selection. Z Chen, A Rosenberg, Y Zhang, G Wang, B Ramabhadran, PJ Moreno INTERSPEECH, 556-560, 2020 | 42 | 2020 |
Injecting text in self-supervised speech pretraining Z Chen, Y Zhang, A Rosenberg, B Ramabhadran, G Wang, P Moreno 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 36 | 2021 |
Tts4pretrain 2.0: Advancing the use of Text and Speech in ASR Pretraining with Consistency and Contrastive Losses Z Chen, Y Zhang, A Rosenberg, B Ramabhadran, P Moreno, G Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 23 | 2022 |
Modular hybrid autoregressive transducer Z Meng, T Chen, R Prabhavalkar, Y Zhang, G Wang, K Audhkhasi, ... 2022 IEEE Spoken Language Technology Workshop (SLT), 197-204, 2023 | 20 | 2023 |
Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data A Aksënova, Z Chen, CC Chiu, D van Esch, P Golik, W Han, L King, ... arXiv preprint arXiv:2205.08014, 2022 | 19 | 2022 |
Semi-Supervision in ASR: Sequential MixMatch and Factorized TTS-Based Augmentation. Z Chen, A Rosenberg, Y Zhang, H Zen, M Ghodsi, Y Huang, J Emond, ... Interspeech, 736-740, 2021 | 14 | 2021 |
Virtuoso: Massive multilingual speech-text joint semi-supervised learning for text-to-speech T Saeki, H Zen, Z Chen, N Morioka, G Wang, Y Zhang, A Bapna, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 12 | 2023 |
Understanding Shared Speech-Text Representations G Wang, K Kastner, A Bapna, Z Chen, A Rosenberg, B Ramabhadran, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 11 | 2023 |
SCADA: Stochastic, Consistent and Adversarial Data Augmentation to Improve ASR. G Wang, A Rosenberg, Z Chen, Y Zhang, B Ramabhadran, PJ Moreno INTERSPEECH, 2832-2836, 2020 | 10 | 2020 |
Deep text-to-speech system with seq2seq model G Wang arXiv preprint arXiv:1903.07398, 2019 | 9 | 2019 |
Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data T Saeki, G Wang, N Morioka, I Elias, K Kastner, A Rosenberg, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 4 | 2024 |
Using Text Injection to Improve Recognition of Personal Identifiers in Speech Y Blau, R Agrawal, L Madmony, G Wang, A Rosenberg, Z Chen, ... arXiv preprint arXiv:2308.07393, 2023 | 3 | 2023 |
Supervised and Unsupervised Training with Contrastive Loss Over Sequences A Rosenberg, B Ramabhadran, Z Chen, G Wang, Y Zhang, J Emond US Patent App. 17/655,903, 2022 | 2 | 2022 |
Non-Parallel Voice Conversion for ASR Augmentation G Wang, A Rosenberg, B Ramabhadran, F Biadsy, Y Huang, J Emond, ... arXiv preprint arXiv:2209.06987, 2022 | 2 | 2022 |
G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR G Wang, ED Cubuk, A Rosenberg, S Cheng, RJ Weiss, B Ramabhadran, ... 2022 IEEE Spoken Language Technology Workshop (SLT), 23-30, 2023 | 1 | 2023 |
Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model HJ Park, D Agarwal, N Chen, R Sun, K Partridge, J Chen, H Zhang, P Zhu, ... arXiv preprint arXiv:2407.18879, 2024 | | 2024 |