Contrastive conditional latent diffusion for audio-visual segmentation
We propose a latent diffusion model with contrastive learning for audio-visual segmentation
(AVS) to extensively explore the contribution of audio. We interpret AVS as a conditional …
(AVS) to extensively explore the contribution of audio. We interpret AVS as a conditional …
Med-cDiff: Conditional medical image generation with diffusion models
Conditional image generation plays a vital role in medical image analysis as it is effective in
tasks such as super-resolution, denoising, and inpainting, among others. Diffusion models …
tasks such as super-resolution, denoising, and inpainting, among others. Diffusion models …
DiffLoc: Diffusion Model for Outdoor LiDAR Localization
Absolute pose regression (APR) estimates global pose in an end-to-end manner achieving
impressive results in learn-based LiDAR localization. However compared to the top …
impressive results in learn-based LiDAR localization. However compared to the top …
Tyche: Stochastic In-Context Learning for Medical Image Segmentation
Existing learning-based solutions to medical image segmentation have two important
shortcomings. First for most new segmentation tasks a new model has to be trained or fine …
shortcomings. First for most new segmentation tasks a new model has to be trained or fine …
A Recycling Training Strategy for Medical Image Segmentation with Diffusion Denoising Models
Denoising diffusion models have found applications in image segmentation by generating
segmented masks conditioned on images. Existing studies predominantly focus on adjusting …
segmented masks conditioned on images. Existing studies predominantly focus on adjusting …
Diffusion-based Blind Text Image Super-Resolution
Recovering degraded low-resolution text images is challenging especially for Chinese text
images with complex strokes and severe degradation in real-world scenarios. Ensuring both …
images with complex strokes and severe degradation in real-world scenarios. Ensuring both …
Sequential Amodal Segmentation via Cumulative Occlusion Learning
To fully understand the 3D context of a single image, a visual system must be able to
segment both the visible and occluded regions of objects, while discerning their occlusion …
segment both the visible and occluded regions of objects, while discerning their occlusion …
Tuning-Free Adaptive Style Incorporation for Structure-Consistent Text-Driven Style Transfer
Y Ge, J Liu, Q Fan, X Jiang, Y Huang, S Qin… - arXiv preprint arXiv …, 2024 - arxiv.org
In this work, we target the task of text-driven style transfer in the context of text-to-image (T2I)
diffusion models. The main challenge is consistent structure preservation while enabling …
diffusion models. The main challenge is consistent structure preservation while enabling …
Probabilistic brain extraction in MR images via conditional generative adversarial networks
Brain extraction, or the task of segmenting the brain in MR images, forms an essential step
for many neuroimaging applications. These include quantifying brain tissue volumes …
for many neuroimaging applications. These include quantifying brain tissue volumes …
Investigating and Improving Latent Density Segmentation Models for Aleatoric Uncertainty Quantification in Medical Imaging
Data uncertainties, such as sensor noise or occlusions, can introduce irreducible
ambiguities in images, which result in varying, yet plausible, semantic hypotheses. In …
ambiguities in images, which result in varying, yet plausible, semantic hypotheses. In …