Contrastive conditional latent diffusion for audio-visual segmentation

Y Mao, J Zhang, M Xiang, Y Lv, Y Zhong… - arXiv preprint arXiv …, 2023 - arxiv.org
We propose a latent diffusion model with contrastive learning for audio-visual segmentation
(AVS) to extensively explore the contribution of audio. We interpret AVS as a conditional …

Contrastive Conditional Latent Diffusion for Audio-visual Segmentation

Y Mao, J Zhang, M Xiang, Y Lv, Y Zhong… - arXiv e-prints, 2023 - ui.adsabs.harvard.edu
We propose a latent diffusion model with contrastive learning for audio-visual segmentation
(AVS) to extensively explore the contribution of audio. We interpret AVS as a conditional …

[PDF][PDF] Contrastive Conditional Latent Diffusion for Audio-visual Segmentation

Y Mao, J Zhang, M Xiang, Y Lv, Y Zhong, Y Dai - researchgate.net
We propose a latent diffusion model with contrastive learning for audio-visual segmentation
(AVS) to extensively explore the contribution of audio. We interpret AVS as a conditional …