Sora: A review on background, technology, limitations, and opportunities of large vision models
Sora is a text-to-video generative AI model, released by OpenAI in February 2024. The
model is trained to generate videos of realistic or imaginative scenes from text instructions …
model is trained to generate videos of realistic or imaginative scenes from text instructions …
A survey of the vision transformers and their CNN-transformer based variants
Vision transformers have become popular as a possible substitute to convolutional neural
networks (CNNs) for a variety of computer vision applications. These transformers, with their …
networks (CNNs) for a variety of computer vision applications. These transformers, with their …
[HTML][HTML] A survey of emerging applications of diffusion probabilistic models in mri
Diffusion probabilistic models (DPMs) which employ explicit likelihood characterization and
a gradual sampling process to synthesize data, have gained increasing research interest …
a gradual sampling process to synthesize data, have gained increasing research interest …
How to Protect Copyright Data in Optimization of Large Language Models?
The softmax operator is a crucial component of large language models (LLMs), which have
played a transformative role in computer research. Due to the centrality of the softmax …
played a transformative role in computer research. Due to the centrality of the softmax …
Medical sam 2: Segment medical images as video via segment anything model 2
J Zhu, Y Qi, J Wu - arXiv preprint arXiv:2408.00874, 2024 - arxiv.org
Medical image segmentation plays a pivotal role in clinical diagnostics and treatment
planning, yet existing models often face challenges in generalization and in handling both …
planning, yet existing models often face challenges in generalization and in handling both …
Stochastic segmentation with conditional categorical diffusion models
Semantic segmentation has made significant progress in recent years thanks to deep neural
networks, but the common objective of generating a single segmentation output that …
networks, but the common objective of generating a single segmentation output that …
Diffusion model for camouflaged object detection
Camouflaged object detection is a challenging task that aims to identify objects that are
highly similar to their background. Due to the powerful noise-to-image denoising capability …
highly similar to their background. Due to the powerful noise-to-image denoising capability …
Dermosegdiff: A boundary-aware segmentation diffusion model for skin lesion delineation
A Bozorgpour, Y Sadegheih, A Kazerouni… - … Workshop on PRedictive …, 2023 - Springer
Skin lesion segmentation plays a critical role in the early detection and accurate diagnosis of
dermatological conditions. Denoising Diffusion Probabilistic Models (DDPMs) have recently …
dermatological conditions. Denoising Diffusion Probabilistic Models (DDPMs) have recently …
Trep: Transformer-based evidential prediction for pedestrian intention with uncertainty
With rapid development in hardware (sensors and processors) and AI algorithms, automated
driving techniques have entered the public's daily life and achieved great success in …
driving techniques have entered the public's daily life and achieved great success in …
Denoising diffusion semantic segmentation with mask prior modeling
The evolution of semantic segmentation has long been dominated by learning more
discriminative image representations for classifying each pixel. Despite the prominent …
discriminative image representations for classifying each pixel. Despite the prominent …