Large language models for software engineering: A systematic literature review

X Hou, Y Zhao, Y Liu, Z Yang, K Wang, L Li… - arXiv preprint arXiv …, 2023 - arxiv.org
Large Language Models (LLMs) have significantly impacted numerous domains, notably
including Software Engineering (SE). Nevertheless, a well-rounded understanding of the …

Benchmarking large language models on cmexam-a comprehensive chinese medical exam dataset

J Liu, P Zhou, Y Hua, D Chong, Z Tian… - Advances in …, 2024 - proceedings.neurips.cc
Recent advancements in large language models (LLMs) have transformed the field of
question answering (QA). However, evaluating LLMs in the medical field is challenging due …

A pathway towards responsible ai generated content

C Chen, J Fu, L Lyu - arXiv preprint arXiv:2303.01325, 2023 - arxiv.org
AI Generated Content (AIGC) has received tremendous attention within the past few years,
with content generated in the format of image, text, audio, video, etc. Meanwhile, AIGC has …

Deepinception: Hypnotize large language model to be jailbreaker

X Li, Z Zhou, J Zhu, J Yao, T Liu, B Han - arXiv preprint arXiv:2311.03191, 2023 - arxiv.org
Despite remarkable success in various applications, large language models (LLMs) are
vulnerable to adversarial jailbreaks that make the safety guardrails void. However, previous …

Benchmarking and defending against indirect prompt injection attacks on large language models

J Yi, Y Xie, B Zhu, K Hines, E Kiciman, G Sun… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent remarkable advancements in large language models (LLMs) have led to their
widespread adoption in various applications. A key feature of these applications is the …

Defending large language models against jailbreaking attacks through goal prioritization

Z Zhang, J Yang, P Ke, M Huang - arXiv preprint arXiv:2311.09096, 2023 - arxiv.org
Large Language Models (LLMs) continue to advance in their capabilities, yet this progress is
accompanied by a growing array of safety risks. While significant attention has been …

Salad-bench: A hierarchical and comprehensive safety benchmark for large language models

L Li, B Dong, R Wang, X Hu, W Zuo, D Lin… - arXiv preprint arXiv …, 2024 - arxiv.org
In the rapidly evolving landscape of Large Language Models (LLMs), ensuring robust safety
measures is paramount. To meet this crucial need, we propose\emph {SALAD-Bench}, a …

Safedecoding: Defending against jailbreak attacks via safety-aware decoding

Z Xu, F Jiang, L Niu, J Jia, BY Lin… - arXiv preprint arXiv …, 2024 - arxiv.org
As large language models (LLMs) become increasingly integrated into real-world
applications such as code generation and chatbot assistance, extensive efforts have been …

ChatGPT in Healthcare from the Perspective of Digital Media: Applications, Opportunities and Challenges

R Xu, Z Wang - Heliyon, 2024 - cell.com
Introduction The emergence and application of generative artificial intelligence/large
language models (hereafter GenAI LLMs) have the potential for significant impact on the …

On large language models' resilience to coercive interrogation

Z Zhang, G Shen, G Tao, S Cheng… - 2024 IEEE Symposium on …, 2024 - computer.org
Abstract Large Language Models (LLMs) are increasingly employed in numerous
applications. It is hence important to ensure that their ethical standard aligns with humans' …