Fsd50k: an open dataset of human-labeled sound events

E Fonseca, X Favory, J Pons, F Font… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org
… In this way we were able to map more than 300,000 … We made the classes of easy and
medium difficulty publicly accessible … We use a model inspired by the original architecture [120] …

Dual t: Reducing estimation error for transition matrix in label-noise learning

Y Yao, T Liu, B Han, M Gong, J Deng… - Advances in neural …, 2020 - proceedings.neurips.cc
… Intuitively, when facing large-scale noisy data, models … on predicting noisy class labels, which
is much easier than learning … we design the intermediate class Y in such a way that P(Y |x) …

Diffsound: Discrete diffusion model for text-to-sound generation

D Yang, J Yu, H Wang, W Wang… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org
… Furthermore, we observe a phenomenon: it is easier to generate audio that only includes
one single event than audio that includes multiple events. To help the Diffsound …

Kalman filter

GF Welch - Computer vision: a reference guide, 2021 - Springer
… true systems, and the noise models rarely exhibit the … such an easy construction of the motion
model or simple computation of … the joint angles in the same way as previously outlined. All …

Image super-resolution via iterative refinement

C Saharia, J Ho, W Chan, T Salimans… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
… In principle, each forward process step can be conditioned on x too, but we we find that a
simple … denoising model fu that takes as input this source image x and a noisy target image ey, …

Simcse: Simple contrastive learning of sentence embeddings

T Gao, X Yao, D Chen - arXiv preprint arXiv:2104.08821, 2021 - arxiv.org
… To further understand the role of dropout noise in unsupervised … We take the checkpoints of
these models every 10 steps during … A simple way to alleviate the problem is postprocessing, …

Diffusion models in vision: A survey

FA Croitoru, V Hondru, RT Ionescu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
… diffusion probabilistic models, noise conditioned score … ] represent an alternative way to
model diffusion, forming the … of generative models that transform a simple Gaussian distribution …

Merlot reserve: Neural script knowledge through vision and language and sound

R Zellers, J Lu, X Lu, Y Yu, Y Zhao… - Proceedings of the …, 2022 - openaccess.thecvf.com
models to learn imperceptible features, like the exact model of the … On EPIC-Kitchens, our
model obtains strong results at … These are easy to learn given access to training data, but we …

Wavegrad: Estimating gradients for waveform generation

N Chen, Y Zhang, H Zen, RJ Weiss, M Norouzi… - arXiv preprint arXiv …, 2020 - arxiv.org
… WaveGrad offers a natural way to trade inference speed for … simple hierarchical sampling
method that mimics the discrete … models condition on a continuous scalar indicating the noise

Real-time gravitational wave science with neural posterior estimation

M Dax, SR Green, J Gair, JH Macke, A Buonanno… - Physical review …, 2021 - APS
… This encodes the signal and noise models within millions of … simulate data, in a noise-model-independent
way, by injecting … This is an easier task than using observational data, which …