Explanations from large language models make small reasoners better

J Huang, KCC Chang - arXiv preprint arXiv:2212.10403, 2022 - arxiv.org

Reasoning is a fundamental aspect of human intelligence that plays a crucial role in
activities such as problem solving, decision making, and critical thinking. In recent years …

被引用次数：401 相关文章所有 6 个版本

[PDF] arxiv.org

Metamath: Bootstrap your own mathematical questions for large language models

L Yu, W Jiang, H Shi, J Yu, Z Liu, Y Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org

Large language models (LLMs) have pushed the limits of natural language understanding
and exhibited excellent problem-solving ability. Despite the great success, most existing …

被引用次数：170 相关文章所有 5 个版本

[PDF] mlr.press

Specializing smaller language models towards multi-step reasoning

Y Fu, H Peng, L Ou, A Sabharwal… - … on Machine Learning, 2023 - proceedings.mlr.press

The surprising ability of Large Language Models (LLMs) to perform well on complex
reasoning with only few-shot chain-of-thought prompts is believed to emerge only in very …

被引用次数：125 相关文章所有 7 个版本

[PDF] arxiv.org

Reasoning with language model prompting: A survey

S Qiao, Y Ou, N Zhang, X Chen, Y Yao, S Deng… - arXiv preprint arXiv …, 2022 - arxiv.org

Reasoning, as an essential ability for complex problem-solving, can provide back-end
support for various real-world applications, such as medical diagnosis, negotiation, etc. This …

被引用次数：164 相关文章所有 6 个版本

[PDF] arxiv.org

Large language models are reasoning teachers

N Ho, L Schmid, SY Yun - arXiv preprint arXiv:2212.10071, 2022 - arxiv.org

Recent works have shown that chain-of-thought (CoT) prompting can elicit language models
to solve complex reasoning tasks, step-by-step. However, prompt-based CoT methods are …

被引用次数：171 相关文章所有 6 个版本

[PDF] arxiv.org

Teaching small language models to reason

LC Magister, J Mallinson, J Adamek, E Malmi… - arXiv preprint arXiv …, 2022 - arxiv.org

Chain of thought prompting successfully improves the reasoning capabilities of large
language models, achieving state of the art results on a range of datasets. However, these …

被引用次数：144 相关文章所有 5 个版本

[PDF] arxiv.org

A survey on model compression for large language models

X Zhu, J Li, Y Liu, C Ma, W Wang - arXiv preprint arXiv:2308.07633, 2023 - arxiv.org

Large Language Models (LLMs) have revolutionized natural language processing tasks with
remarkable success. However, their formidable size and computational demands present …

被引用次数：113 相关文章所有 2 个版本

[PDF] arxiv.org

A survey on transformer compression

Y Tang, Y Wang, J Guo, Z Tu, K Han, H Hu… - arXiv preprint arXiv …, 2024 - arxiv.org

Large models based on the Transformer architecture play increasingly vital roles in artificial
intelligence, particularly within the realms of natural language processing (NLP) and …

被引用次数：9 相关文章所有 2 个版本

[PDF] arxiv.org

Distilling reasoning capabilities into smaller language models

K Shridhar, A Stolfo, M Sachan - arXiv preprint arXiv:2212.00193, 2022 - arxiv.org

Step-by-step reasoning approaches like chain of thought (CoT) have proved to be very
effective in inducing reasoning capabilities in large language models. However, the success …

被引用次数：82 相关文章所有 5 个版本

[PDF] arxiv.org

Symbolic chain-of-thought distillation: Small models can also" think" step-by-step

LH Li, J Hessel, Y Yu, X Ren, KW Chang… - arXiv preprint arXiv …, 2023 - arxiv.org

Chain-of-thought prompting (eg," Let's think step-by-step") primes large language models to
verbalize rationalization for their predictions. While chain-of-thought can lead to dramatic …

被引用次数：72 相关文章所有 6 个版本