Sora: A review on background, technology, limitations, and opportunities of large vision models

Y Liu, K Zhang, Y Li, Z Yan, C Gao, R Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Sora is a text-to-video generative AI model, released by OpenAI in February 2024. The
model is trained to generate videos of realistic or imaginative scenes from text instructions …

A survey of the vision transformers and their CNN-transformer based variants

A Khan, Z Rauf, A Sohail, AR Khan, H Asif… - Artificial Intelligence …, 2023 - Springer
Vision transformers have become popular as a possible substitute to convolutional neural
networks (CNNs) for a variety of computer vision applications. These transformers, with their …

[HTML][HTML] A survey of emerging applications of diffusion probabilistic models in mri

Y Fan, H Liao, S Huang, Y Luo, H Fu, H Qi - Meta-Radiology, 2024 - Elsevier
Diffusion probabilistic models (DPMs) which employ explicit likelihood characterization and
a gradual sampling process to synthesize data, have gained increasing research interest …

How to Protect Copyright Data in Optimization of Large Language Models?

T Chu, Z Song, C Yang - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
The softmax operator is a crucial component of large language models (LLMs), which have
played a transformative role in computer research. Due to the centrality of the softmax …

Stochastic segmentation with conditional categorical diffusion models

L Zbinden, L Doorenbos, T Pissas… - Proceedings of the …, 2023 - openaccess.thecvf.com
Semantic segmentation has made significant progress in recent years thanks to deep neural
networks, but the common objective of generating a single segmentation output that …

Diffusion model for camouflaged object detection

Z Chen, R Gao, TZ Xiang, F Lin - ECAI 2023, 2023 - ebooks.iospress.nl
Camouflaged object detection is a challenging task that aims to identify objects that are
highly similar to their background. Due to the powerful noise-to-image denoising capability …

Dermosegdiff: A boundary-aware segmentation diffusion model for skin lesion delineation

A Bozorgpour, Y Sadegheih, A Kazerouni… - … Workshop on PRedictive …, 2023 - Springer
Skin lesion segmentation plays a critical role in the early detection and accurate diagnosis of
dermatological conditions. Denoising Diffusion Probabilistic Models (DDPMs) have recently …

Trep: Transformer-based evidential prediction for pedestrian intention with uncertainty

Z Zhang, R Tian, Z Ding - Proceedings of the AAAI Conference on …, 2023 - ojs.aaai.org
With rapid development in hardware (sensors and processors) and AI algorithms, automated
driving techniques have entered the public's daily life and achieved great success in …

Denoising diffusion semantic segmentation with mask prior modeling

Z Lai, Y Duan, J Dai, Z Li, Y Fu, H Li, Y Qiao… - arXiv preprint arXiv …, 2023 - arxiv.org
The evolution of semantic segmentation has long been dominated by learning more
discriminative image representations for classifying each pixel. Despite the prominent …

Llcaps: Learning to illuminate low-light capsule endoscopy with curved wavelet attention and reverse diffusion

L Bai, T Chen, Y Wu, A Wang, M Islam… - … Conference on Medical …, 2023 - Springer
Wireless capsule endoscopy (WCE) is a painless and non-invasive diagnostic tool for
gastrointestinal (GI) diseases. However, due to GI anatomical constraints and hardware …