Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

文章

学术资源搜索

获得 4 条结果（用时0.02秒）

我的图书馆

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

在引用文章中搜索

[PDF] arxiv.org

Internal consistency and self-feedback in large language models: A survey

X Liang, S Song, Z Zheng, H Wang, Q Yu, X Li… - arXiv preprint arXiv …, 2024 - arxiv.org

Large language models (LLMs) are expected to respond accurately but often exhibit
deficient reasoning or generate hallucinatory content. To address these, studies prefixed …

被引用次数：6 相关文章所有 3 个版本

[PDF] arxiv.org

LiteSearch: Efficacious Tree Search for LLM

A Wang, L Song, Y Tian, B Peng, D Yu, H Mi… - arXiv preprint arXiv …, 2024 - arxiv.org

Recent research suggests that tree search algorithms (eg Monte Carlo Tree Search) can
dramatically boost LLM performance on complex mathematical reasoning tasks. However …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

X Wang, L Song, Y Tian, D Yu, B Peng, H Mi… - arXiv preprint arXiv …, 2024 - arxiv.org

Monte Carlo Tree Search (MCTS) has recently emerged as a powerful technique for
enhancing the reasoning capabilities of LLMs. Techniques such as SFT or DPO have …

[PDF] researchgate.net

[PDF][PDF] Optimizing Task Planning Efficiency in LLMs: Beyond Closed-Loop Systems

L Liu, A Nair, T Peng, S Desai, M Gupta… - Authorea …, 2024 - researchgate.net

Large language models (LLMs) have shown great promise in task execution, but traditional
closed-loop systems limit their planning efficiency. Addressing this challenge, we introduce …