Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding C Saharia, W Chan, S Saxena, L Li, J Whang, E Denton, ... NeurIPS, 2022 | 3842 | 2022 |
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition DS Park, W Chan, Y Zhang, CC Chiu, B Zoph, ED Cubuk, QV Le INTERSPEECH, 2019 | 3840 | 2019 |
Listen, Attend and Spell: A Neural Network for Large Vocabulary Conversational Speech Recognition W Chan, N Jaitly, QV Le, O Vinyals ICASSP, 2016 | 3207* | 2016 |
Image Super-Resolution via Iterative Refinement C Saharia, J Ho, W Chan, T Salimans, D Fleet, M Norouzi IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022 | 1238 | 2022 |
Palette: Image-to-Image Diffusion Models C Saharia, W Chan, H Chang, C A. Lee, J Ho, D Tim Salimans, J. Fleet, ... SIGGRAPH, 2022 | 998 | 2022 |
Imagen Video: High Definition Video Generation with Diffusion Models J Ho, W Chan, C Saharia, J Whang, R Gao, A Gritsenko, D P. Kingma, ... arXiv:2210.02303, 2022 | 882 | 2022 |
Video Diffusion Models J Ho, T Salimans, A Gritsenko, W Chan, M Norouzi, D Fleet arXiv:2204.03458, 2022 | 874 | 2022 |
Cascaded Diffusion Models for High Fidelity Image Generation J Ho, C Saharia, W Chan, D Fleet, M Norouzi, T Salimans Journal of Machine Learning Research 23 (47), 1-33, 2022 | 833 | 2022 |
WaveGrad: Estimating Gradients for Waveform Generation N Chen, Y Zhang, H Zen, R Weiss, M Norouzi, W Chan ICLR, 2021 | 664 | 2021 |
Very Deep Convolutional Networks for End-to-End Speech Recognition Y Zhang, W Chan, N Jaitly ICASSP, 2017 | 552 | 2017 |
Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM T Hori, S Watanabe, Y Zhang, W Chan INTERSPEECH, 2017 | 349 | 2017 |
Insertion Transformer: Flexible Sequence Generation via Insertion Operations M Stern, W Chan, J Kiros, J Uszkoreit ICML, 2019 | 248 | 2019 |
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019 | 202 | 2019 |
Novel View Synthesis with Diffusion Models D Watson, W Chan, R Martin-Brualla, J Ho, A Tagliasacchi, M Norouzi ICLR, 2023 | 186 | 2023 |
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition Y Zhang, DS Park, W Han, J Qin, A Gulati, J Shor, A Jansen, Y Xu, ... IEEE Journal of Selected Topics in Signal Processing, 2021 | 159 | 2021 |
Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes B Li, Y Zhang, T Sainath, Y Wu, W Chan ICASSP, 2019 | 151 | 2019 |
SpecAugment on Large Scale Datasets D Park, Y Zhang, CC Chiu, Y Chen, B Li, W Chan, Q Le, Y Wu ICASSP, 2020 | 149 | 2020 |
Predicting Collective Sentiment Dynamics from Time-series Social Media L Nguyen, P Wu, W Chan, W Peng, Y Zhang SIGKDD WISDOM, 2012 | 143 | 2012 |
Non-Autoregressive Machine Translation with Latent Alignments C Saharia, W Chan, S Saxena, Norouzi, Mohammad EMNLP, 2020 | 142 | 2020 |
Learning Fast Samplers for Diffusion Models by Differentiating Through Sample Quality D Watson, W Chan, J Ho, M Norouzi ICLR, 2022 | 138 | 2022 |