Contrastive conditional latent diffusion for audio-visual segmentation

Y Mao, J Zhang, M Xiang, Y Lv, Y Zhong… - arXiv preprint arXiv …, 2023 - arxiv.org
We propose a latent diffusion model with contrastive learning for audio-visual segmentation
(AVS) to extensively explore the contribution of audio. We interpret AVS as a conditional …

Med-cDiff: Conditional medical image generation with diffusion models

ALY Hung, K Zhao, H Zheng, R Yan, SS Raman… - Bioengineering, 2023 - mdpi.com
Conditional image generation plays a vital role in medical image analysis as it is effective in
tasks such as super-resolution, denoising, and inpainting, among others. Diffusion models …

DiffLoc: Diffusion Model for Outdoor LiDAR Localization

W Li, Y Yang, S Yu, G Hu, C Wen… - Proceedings of the …, 2024 - openaccess.thecvf.com
Absolute pose regression (APR) estimates global pose in an end-to-end manner achieving
impressive results in learn-based LiDAR localization. However compared to the top …

Tyche: Stochastic In-Context Learning for Medical Image Segmentation

M Rakic, HE Wong, JJG Ortiz… - Proceedings of the …, 2024 - openaccess.thecvf.com
Existing learning-based solutions to medical image segmentation have two important
shortcomings. First for most new segmentation tasks a new model has to be trained or fine …

A Recycling Training Strategy for Medical Image Segmentation with Diffusion Denoising Models

Y Fu, Y Li, SU Saeed, MJ Clarkson, Y Hu - arXiv preprint arXiv:2308.16355, 2023 - arxiv.org
Denoising diffusion models have found applications in image segmentation by generating
segmented masks conditioned on images. Existing studies predominantly focus on adjusting …

Diffusion-based Blind Text Image Super-Resolution

Y Zhang, J Zhang, H Li, Z Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recovering degraded low-resolution text images is challenging especially for Chinese text
images with complex strokes and severe degradation in real-world scenarios. Ensuring both …

Sequential Amodal Segmentation via Cumulative Occlusion Learning

J Ao, Q Ke, KA Ehinger - arXiv preprint arXiv:2405.05791, 2024 - arxiv.org
To fully understand the 3D context of a single image, a visual system must be able to
segment both the visible and occluded regions of objects, while discerning their occlusion …

Tuning-Free Adaptive Style Incorporation for Structure-Consistent Text-Driven Style Transfer

Y Ge, J Liu, Q Fan, X Jiang, Y Huang, S Qin… - arXiv preprint arXiv …, 2024 - arxiv.org
In this work, we target the task of text-driven style transfer in the context of text-to-image (T2I)
diffusion models. The main challenge is consistent structure preservation while enabling …

Probabilistic brain extraction in MR images via conditional generative adversarial networks

S Moazami, D Ray, D Pelletier… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Brain extraction, or the task of segmenting the brain in MR images, forms an essential step
for many neuroimaging applications. These include quantifying brain tissue volumes …

Investigating and Improving Latent Density Segmentation Models for Aleatoric Uncertainty Quantification in Medical Imaging

MM Valiuddin, CGA Viviers, RJG van Sloun… - arXiv preprint arXiv …, 2023 - arxiv.org
Data uncertainties, such as sensor noise or occlusions, can introduce irreducible
ambiguities in images, which result in varying, yet plausible, semantic hypotheses. In …