关注
Gary Wang
Gary Wang
在 google.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Y Zhang, W Han, J Qin, Y Wang, A Bapna, Z Chen, N Chen, B Li, ...
arXiv preprint arXiv:2303.01037, 2023
2002023
A long-short term memory recurrent neural network based reinforcement learning controller for office heating ventilation and air conditioning systems
Y Wang, K Velswamy, B Huang
Processes 5 (3), 46, 2017
1942017
Improving Speech Recognition Using Consistent Predictions on Synthesized Speech
G Wang, A Rosenberg, Z Chen, Y Zhang, B Ramabhadran, Y Wu, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
592020
A Novel Approach to Feedback Control with Deep Reinforcement Learning
Y Wang, K Velswamy, B Huang
IFAC-PapersOnLine 51 (18), 31-36, 2018
572018
Improving Speech Recognition Using GAN-Based Speech Synthesis and Contrastive Unspoken Text Selection.
Z Chen, A Rosenberg, Y Zhang, G Wang, B Ramabhadran, PJ Moreno
INTERSPEECH, 556-560, 2020
422020
Injecting text in self-supervised speech pretraining
Z Chen, Y Zhang, A Rosenberg, B Ramabhadran, G Wang, P Moreno
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
362021
Tts4pretrain 2.0: Advancing the use of Text and Speech in ASR Pretraining with Consistency and Contrastive Losses
Z Chen, Y Zhang, A Rosenberg, B Ramabhadran, P Moreno, G Wang
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
232022
Modular hybrid autoregressive transducer
Z Meng, T Chen, R Prabhavalkar, Y Zhang, G Wang, K Audhkhasi, ...
2022 IEEE Spoken Language Technology Workshop (SLT), 197-204, 2023
202023
Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data
A Aksënova, Z Chen, CC Chiu, D van Esch, P Golik, W Han, L King, ...
arXiv preprint arXiv:2205.08014, 2022
192022
Semi-Supervision in ASR: Sequential MixMatch and Factorized TTS-Based Augmentation.
Z Chen, A Rosenberg, Y Zhang, H Zen, M Ghodsi, Y Huang, J Emond, ...
Interspeech, 736-740, 2021
142021
Virtuoso: Massive multilingual speech-text joint semi-supervised learning for text-to-speech
T Saeki, H Zen, Z Chen, N Morioka, G Wang, Y Zhang, A Bapna, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
122023
Understanding Shared Speech-Text Representations
G Wang, K Kastner, A Bapna, Z Chen, A Rosenberg, B Ramabhadran, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
112023
SCADA: Stochastic, Consistent and Adversarial Data Augmentation to Improve ASR.
G Wang, A Rosenberg, Z Chen, Y Zhang, B Ramabhadran, PJ Moreno
INTERSPEECH, 2832-2836, 2020
102020
Deep text-to-speech system with seq2seq model
G Wang
arXiv preprint arXiv:1903.07398, 2019
92019
Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data
T Saeki, G Wang, N Morioka, I Elias, K Kastner, A Rosenberg, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
42024
Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Y Blau, R Agrawal, L Madmony, G Wang, A Rosenberg, Z Chen, ...
arXiv preprint arXiv:2308.07393, 2023
32023
Supervised and Unsupervised Training with Contrastive Loss Over Sequences
A Rosenberg, B Ramabhadran, Z Chen, G Wang, Y Zhang, J Emond
US Patent App. 17/655,903, 2022
22022
Non-Parallel Voice Conversion for ASR Augmentation
G Wang, A Rosenberg, B Ramabhadran, F Biadsy, Y Huang, J Emond, ...
arXiv preprint arXiv:2209.06987, 2022
22022
G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR
G Wang, ED Cubuk, A Rosenberg, S Cheng, RJ Weiss, B Ramabhadran, ...
2022 IEEE Spoken Language Technology Workshop (SLT), 23-30, 2023
12023
Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model
HJ Park, D Agarwal, N Chen, R Sun, K Partridge, J Chen, H Zhang, P Zhu, ...
arXiv preprint arXiv:2407.18879, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–20