A survey of attacks on large vision-language models: Resources, advances, and future trends
With the significant development of large models in recent years, Large Vision-Language
Models (LVLMs) have demonstrated remarkable capabilities across a wide range of …
Models (LVLMs) have demonstrated remarkable capabilities across a wide range of …
Eyes closed, safety on: Protecting multimodal llms via image-to-text transformation
Multimodal large language models (MLLMs) have shown impressive reasoning abilities.
However, they are also more vulnerable to jailbreak attacks than their LLM predecessors …
However, they are also more vulnerable to jailbreak attacks than their LLM predecessors …
Privacy in large language models: Attacks, defenses and future directions
The advancement of large language models (LLMs) has significantly enhanced the ability to
effectively tackle various downstream NLP tasks and unify these tasks into generative …
effectively tackle various downstream NLP tasks and unify these tasks into generative …
Controllable text generation for large language models: A survey
In Natural Language Processing (NLP), Large Language Models (LLMs) have demonstrated
high text generation quality. However, in real-world applications, LLMs must meet …
high text generation quality. However, in real-world applications, LLMs must meet …
Cross-modality safety alignment
As Artificial General Intelligence (AGI) becomes increasingly integrated into various facets of
human life, ensuring the safety and ethical alignment of such systems is paramount …
human life, ensuring the safety and ethical alignment of such systems is paramount …
Locking down the finetuned llms safety
Fine-tuning large language models (LLMs) on additional datasets is often necessary to
optimize them for specific downstream tasks. However, existing safety alignment measures …
optimize them for specific downstream tasks. However, existing safety alignment measures …
Bathe: Defense against the jailbreak attack in multimodal large language models by treating harmful instruction as backdoor trigger
Multimodal Large Language Models (MLLMs) have showcased impressive performance in a
variety of multimodal tasks. On the other hand, the integration of additional image modality …
variety of multimodal tasks. On the other hand, the integration of additional image modality …
Towards tracing trustworthiness dynamics: Revisiting pre-training period of large language models
Ensuring the trustworthiness of large language models (LLMs) is crucial. Most studies
concentrate on fully pre-trained LLMs to better understand and improve LLMs' …
concentrate on fully pre-trained LLMs to better understand and improve LLMs' …
Safealigner: Safety alignment against jailbreak attacks via response disparity guidance
As the development of large language models (LLMs) rapidly advances, securing these
models effectively without compromising their utility has become a pivotal area of research …
models effectively without compromising their utility has become a pivotal area of research …
Safety of Multimodal Large Language Models on Images and Text
Attracted by the impressive power of Multimodal Large Language Models (MLLMs), the
public is increasingly utilizing them to improve the efficiency of daily work. Nonetheless, the …
public is increasingly utilizing them to improve the efficiency of daily work. Nonetheless, the …