Sora: A review on background, technology, limitations, and opportunities of large vision models

Y Liu, K Zhang, Y Li, Z Yan, C Gao, R Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Sora is a text-to-video generative AI model, released by OpenAI in February 2024. The
model is trained to generate videos of realistic or imaginative scenes from text instructions …

A survey of the vision transformers and their CNN-transformer based variants

A Khan, Z Rauf, A Sohail, AR Khan, H Asif… - Artificial Intelligence …, 2023 - Springer
Vision transformers have become popular as a possible substitute to convolutional neural
networks (CNNs) for a variety of computer vision applications. These transformers, with their …

[HTML][HTML] A survey of emerging applications of diffusion probabilistic models in mri

Y Fan, H Liao, S Huang, Y Luo, H Fu, H Qi - Meta-Radiology, 2024 - Elsevier
Diffusion probabilistic models (DPMs) which employ explicit likelihood characterization and
a gradual sampling process to synthesize data, have gained increasing research interest …

How to Protect Copyright Data in Optimization of Large Language Models?

T Chu, Z Song, C Yang - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
The softmax operator is a crucial component of large language models (LLMs), which have
played a transformative role in computer research. Due to the centrality of the softmax …

Medical sam 2: Segment medical images as video via segment anything model 2

J Zhu, Y Qi, J Wu - arXiv preprint arXiv:2408.00874, 2024 - arxiv.org
Medical image segmentation plays a pivotal role in clinical diagnostics and treatment
planning, yet existing models often face challenges in generalization and in handling both …

Stochastic segmentation with conditional categorical diffusion models

L Zbinden, L Doorenbos, T Pissas… - Proceedings of the …, 2023 - openaccess.thecvf.com
Semantic segmentation has made significant progress in recent years thanks to deep neural
networks, but the common objective of generating a single segmentation output that …

Diffusion model for camouflaged object detection

Z Chen, R Gao, TZ Xiang, F Lin - ECAI 2023, 2023 - ebooks.iospress.nl
Camouflaged object detection is a challenging task that aims to identify objects that are
highly similar to their background. Due to the powerful noise-to-image denoising capability …

Dermosegdiff: A boundary-aware segmentation diffusion model for skin lesion delineation

A Bozorgpour, Y Sadegheih, A Kazerouni… - … Workshop on PRedictive …, 2023 - Springer
Skin lesion segmentation plays a critical role in the early detection and accurate diagnosis of
dermatological conditions. Denoising Diffusion Probabilistic Models (DDPMs) have recently …

Trep: Transformer-based evidential prediction for pedestrian intention with uncertainty

Z Zhang, R Tian, Z Ding - Proceedings of the AAAI Conference on …, 2023 - ojs.aaai.org
With rapid development in hardware (sensors and processors) and AI algorithms, automated
driving techniques have entered the public's daily life and achieved great success in …

Denoising diffusion semantic segmentation with mask prior modeling

Z Lai, Y Duan, J Dai, Z Li, Y Fu, H Li, Y Qiao… - arXiv preprint arXiv …, 2023 - arxiv.org
The evolution of semantic segmentation has long been dominated by learning more
discriminative image representations for classifying each pixel. Despite the prominent …