Mm-llms: Recent advances in multimodal large language models
In the past year, MultiModal Large Language Models (MM-LLMs) have undergone
substantial advancements, augmenting off-the-shelf LLMs to support MM inputs or outputs …
substantial advancements, augmenting off-the-shelf LLMs to support MM inputs or outputs …
Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking Multimodal Large Language Models
In this paper, we study the harmlessness alignment problem of multimodal large language
models~(MLLMs). We conduct a systematic empirical analysis of the harmlessness …
models~(MLLMs). We conduct a systematic empirical analysis of the harmlessness …
Adashield: Safeguarding multimodal large language models from structure-based attack via adaptive shield prompting
With the advent and widespread deployment of Multimodal Large Language Models
(MLLMs), the imperative to ensure their safety has become increasingly pronounced …
(MLLMs), the imperative to ensure their safety has become increasingly pronounced …
Eyes closed, safety on: Protecting multimodal llms via image-to-text transformation
Multimodal large language models (MLLMs) have shown impressive reasoning abilities,
which, however, are also more vulnerable to jailbreak attacks than their LLM predecessors …
which, however, are also more vulnerable to jailbreak attacks than their LLM predecessors …
Safety of Multimodal Large Language Models on Images and Text
Attracted by the impressive power of Multimodal Large Language Models (MLLMs), the
public is increasingly utilizing them to improve the efficiency of daily work. Nonetheless, the …
public is increasingly utilizing them to improve the efficiency of daily work. Nonetheless, the …
A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends
D Liu, M Yang, X Qu, P Zhou, W Hu… - arXiv preprint arXiv …, 2024 - arxiv.org
With the significant development of large models in recent years, Large Vision-Language
Models (LVLMs) have demonstrated remarkable capabilities across a wide range of …
Models (LVLMs) have demonstrated remarkable capabilities across a wide range of …
ImgTrojan: Jailbreaking Vision-Language Models with ONE Image
There has been an increasing interest in the alignment of large language models (LLMs)
with human values. However, the safety issues of their integration with a vision module, or …
with human values. However, the safety issues of their integration with a vision module, or …
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model
The emergence of Vision Language Models (VLMs) has brought unprecedented advances
in understanding multimodal information. The combination of textual and visual semantics in …
in understanding multimodal information. The combination of textual and visual semantics in …
A Survey on Safe Multi-Modal Learning System
With the wide deployment of multimodal learning systems (MMLS) in real-world scenarios,
safety concerns have become increasingly prominent. The absence of systematic research …
safety concerns have become increasingly prominent. The absence of systematic research …
Unbridled Icarus: A Survey of the Potential Perils of Image Inputs in Multimodal Large Language Model Security
Multimodal Large Language Models (MLLMs) demonstrate remarkable capabilities that
increasingly influence various aspects of our daily lives, constantly defining the new …
increasingly influence various aspects of our daily lives, constantly defining the new …