Noise estimation for generative diffusion models R San-Roman, E Nachmani, L Wolf arXiv preprint arXiv:2104.02600, 2021 | 95 | 2021 |
Seamless: Multilingual Expressive and Streaming Speech Translation L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, M Duppenthaler, ... arXiv preprint arXiv:2312.05187, 2023 | 55 | 2023 |
Non Gaussian Denoising Diffusion Models E Nachmani, RS Roman, L Wolf arXiv preprint arXiv:2106.07582, 2021 | 55 | 2021 |
Denoising diffusion gamma models E Nachmani, RS Roman, L Wolf arXiv preprint arXiv:2110.05948, 2021 | 21 | 2021 |
From discrete tokens to high-fidelity audio using multi-band diffusion R San Roman, Y Adi, A Deleforge, R Serizel, G Synnaeve, A Défossez Advances in Neural Information Processing Systems 36, 2024 | 10 | 2024 |
Proactive detection of voice cloning with localized watermarking R San Roman, P Fernandez, H Elsahar, A Défossez, T Furon, T Tran International Conference on Machine Learning 235, 2024 | 9* | 2024 |