A survey on lora of large language models

Y Mao, Y Ge, Y Fan, W Xu, Y Mi, Z Hu… - Frontiers of Computer …, 2025 - Springer
Abstract Low-Rank Adaptation (LoRA), which updates the dense neural network layers with
pluggable low-rank matrices, is one of the best performed parameter efficient fine-tuning …

Strategic preys make acute predators: Enhancing camouflaged object detectors by generating camouflaged objects

C He, K Li, Y Zhang, Y Zhang, Z Guo, X Li… - arXiv preprint arXiv …, 2023 - arxiv.org
Camouflaged object detection (COD) is the challenging task of identifying camouflaged
objects visually blended into surroundings. Albeit achieving remarkable success, existing …

Gra: Detecting oriented objects through group-wise rotating and attention

J Wang, Y Pu, Y Han, J Guo, Y Wang, X Li… - European Conference on …, 2025 - Springer
Oriented object detection, an emerging task in recent years, aims to identify and locate
objects across varied orientations. This requires the detector to accurately capture the …

Efficient diffusion transformer with step-wise dynamic attention mediators

Y Pu, Z Xia, J Guo, D Han, Q Li, D Li, Y Yuan… - … on Computer Vision, 2025 - Springer
This paper identifies significant redundancy in the query-key interactions within self-attention
mechanisms of diffusion transformer models, particularly during the early stages of …

Adanat: Exploring adaptive policy for token-based image generation

Z Ni, Y Wang, R Zhou, R Lu, J Guo, J Hu, Z Liu… - … on Computer Vision, 2025 - Springer
Recent studies have demonstrated the effectiveness of token-based methods for visual
content generation. As a representative work, non-autoregressive Transformers (NATs) are …

Diffusion models and representation learning: A survey

M Fuest, P Ma, M Gui, JS Fischer, VT Hu… - arXiv preprint arXiv …, 2024 - arxiv.org
Diffusion Models are popular generative modeling methods in various vision tasks, attracting
significant attention. They can be considered a unique instance of self-supervised learning …

Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment

J Guo, J Zhao, C Ge, C Du, Z Ni, S Song, H Shi… - arXiv preprint arXiv …, 2024 - arxiv.org
Test-time adaptation (TTA) aims to enhance the performance of source-domain pretrained
models when tested on unknown shifted target domains. Traditional TTA methods primarily …

Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices

Z Ma, Y Zhang, G Jia, L Zhao, Y Ma, M Ma… - arXiv preprint arXiv …, 2024 - arxiv.org
As one of the most popular and sought-after generative models in the recent years, diffusion
models have sparked the interests of many researchers and steadily shown excellent …

Taming Rectified Flow for Inversion and Editing

J Wang, J Pu, Z Qi, J Guo, Y Ma, N Huang… - arXiv preprint arXiv …, 2024 - arxiv.org
Rectified-flow-based diffusion transformers, such as FLUX and OpenSora, have
demonstrated exceptional performance in the field of image and video generation. Despite …

COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing

J Wang, Y Ma, J Guo, Y Xiao, G Huang, X Li - arXiv preprint arXiv …, 2024 - arxiv.org
Video editing is an emerging task, in which most current methods adopt the pre-trained text-
to-image (T2I) diffusion model to edit the source video in a zero-shot manner. Despite …