Benchmarking large language models on cmexam-a comprehensive chinese medical exam dataset
Recent advancements in large language models (LLMs) have transformed the field of
question answering (QA). However, evaluating LLMs in the medical field is challenging due …
question answering (QA). However, evaluating LLMs in the medical field is challenging due …
The instruction hierarchy: Training llms to prioritize privileged instructions
Today's LLMs are susceptible to prompt injections, jailbreaks, and other attacks that allow
adversaries to overwrite a model's original instructions with their own malicious prompts. In …
adversaries to overwrite a model's original instructions with their own malicious prompts. In …
StruQ: Defending against prompt injection with structured queries
Recent advances in Large Language Models (LLMs) enable exciting LLM-integrated
applications, which perform text-based tasks by utilizing their advanced language …
applications, which perform text-based tasks by utilizing their advanced language …
A new era in llm security: Exploring security concerns in real-world llm-based systems
Large Language Model (LLM) systems are inherently compositional, with individual LLM
serving as the core foundation with additional layers of objects such as plugins, sandbox …
serving as the core foundation with additional layers of objects such as plugins, sandbox …
R-judge: Benchmarking safety risk awareness for llm agents
Large language models (LLMs) have exhibited great potential in autonomously completing
tasks across real-world applications. Despite this, these LLM agents introduce unexpected …
tasks across real-world applications. Despite this, these LLM agents introduce unexpected …
MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance
The deployment of multimodal large language models (MLLMs) has brought forth a unique
vulnerability: susceptibility to malicious attacks through visual inputs. We delve into the novel …
vulnerability: susceptibility to malicious attacks through visual inputs. We delve into the novel …
Injecagent: Benchmarking indirect prompt injections in tool-integrated large language model agents
Recent work has embodied LLMs as agents, allowing them to access tools, perform actions,
and interact with external content (eg, emails or websites). However, external content …
and interact with external content (eg, emails or websites). However, external content …
Prioritizing safeguarding over autonomy: Risks of llm agents for science
Intelligent agents powered by large language models (LLMs) have demonstrated substantial
promise in autonomously conducting experiments and facilitating scientific discoveries …
promise in autonomously conducting experiments and facilitating scientific discoveries …
Llm agents can autonomously exploit one-day vulnerabilities
LLMs have becoming increasingly powerful, both in their benign and malicious uses. With
the increase in capabilities, researchers have been increasingly interested in their ability to …
the increase in capabilities, researchers have been increasingly interested in their ability to …
Strengthening multimodal large language model with bootstrapped preference optimization
Multimodal Large Language Models (MLLMs) excel in generating responses based on
visual inputs. However, they often suffer from a bias towards generating responses similar to …
visual inputs. However, they often suffer from a bias towards generating responses similar to …