The capacity for moral self-correction in large language models

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arXiv preprint arXiv …, 2023 - arxiv.org

Large Language Models (LLMs) have shown excellent generalization capabilities that have
led to the development of numerous models. These models propose various new …

被引用次数：180 相关文章所有 3 个版本

[PDF] arxiv.org

Llama 2: Open foundation and fine-tuned chat models

H Touvron, L Martin, K Stone, P Albert… - arXiv preprint arXiv …, 2023 - arxiv.org

In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large
language models (LLMs) ranging in scale from 7 billion to 70 billion parameters. Our fine …

被引用次数：5535 相关文章所有 2 个版本

[PDF] neurips.cc

Language models don't always say what they think: unfaithful explanations in chain-of-thought prompting

M Turpin, J Michael, E Perez… - Advances in Neural …, 2024 - proceedings.neurips.cc

Abstract Large Language Models (LLMs) can achieve strong performance on many tasks by
producing step-by-step reasoning before giving a final output, often referred to as chain-of …

被引用次数：175 相关文章所有 4 个版本

[PDF] neurips.cc

Principle-driven self-alignment of language models from scratch with minimal human supervision

Z Sun, Y Shen, Q Zhou, H Zhang… - Advances in …, 2024 - proceedings.neurips.cc

Recent AI-assistant agents, such as ChatGPT, predominantly rely on supervised fine-tuning
(SFT) with human annotations and reinforcement learning from human feedback (RLHF) to …

被引用次数：174 相关文章所有 7 个版本

[PDF] neurips.cc

Language models can solve computer tasks

G Kim, P Baldi, S McAleer - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Agents capable of carrying out general tasks on a computer can improve efficiency and
productivity by automating repetitive tasks and assisting in complex problem-solving. Ideally …

被引用次数：175 相关文章所有 5 个版本

[PDF] arxiv.org

Large language models as optimizers

C Yang, X Wang, Y Lu, H Liu, QV Le, D Zhou… - arXiv preprint arXiv …, 2023 - arxiv.org

Optimization is ubiquitous. While derivative-based algorithms have been powerful tools for
various problems, the absence of gradient imposes challenges on many real-world …

被引用次数：252 相关文章所有 10 个版本

[PDF] arxiv.org

Teaching large language models to self-debug

X Chen, M Lin, N Schärli, D Zhou - arXiv preprint arXiv:2304.05128, 2023 - arxiv.org

Large language models (LLMs) have achieved impressive performance on code generation.
However, for complex programming tasks, generating the correct solution in one go …

被引用次数：276 相关文章所有 4 个版本

[PDF] arxiv.org

Large language models cannot self-correct reasoning yet

J Huang, X Chen, S Mishra, HS Zheng, AW Yu… - arXiv preprint arXiv …, 2023 - arxiv.org

Large Language Models (LLMs) have emerged as a groundbreaking technology with their
unparalleled text generation capabilities across various applications. Nevertheless …

被引用次数：123 相关文章所有 3 个版本

[PDF] arxiv.org

Critic: Large language models can self-correct with tool-interactive critiquing

Z Gou, Z Shao, Y Gong, Y Shen, Y Yang… - arXiv preprint arXiv …, 2023 - arxiv.org

Recent developments in large language models (LLMs) have been impressive. However,
these models sometimes show inconsistencies and problematic behavior, such as …

被引用次数：108 相关文章所有 4 个版本

[HTML] springer.com Full View

[HTML][HTML] A survey of safety and trustworthiness of large language models through the lens of verification and validation

X Huang, W Ruan, W Huang, G Jin, Y Dong… - Artificial Intelligence …, 2024 - Springer

Large language models (LLMs) have exploded a new heatwave of AI for their ability to
engage end-users in human-level conversations with detailed and articulate answers across …

被引用次数：48 相关文章所有 3 个版本