- 学术资源搜索

Large language models for data annotation: A survey

Z Tan, D Li, S Wang, A Beigi, B Jiang… - arXiv preprint arXiv …, 2024 - arxiv.org

Data annotation generally refers to the labeling or generating of raw data with relevant
information, which could be used for improving the efficacy of machine learning models. The …

被引用次数：95 相关文章所有 2 个版本

[PDF] mit.edu

A survey on model compression for large language models

X Zhu, J Li, Y Liu, C Ma, W Wang - Transactions of the Association for …, 2024 - direct.mit.edu

Abstract Large Language Models (LLMs) have transformed natural language processing
tasks successfully. Yet, their large size and high computational needs pose challenges for …

被引用次数：212 相关文章所有 2 个版本

[PDF] arxiv.org

Graphgpt: Graph instruction tuning for large language models

J Tang, Y Yang, W Wei, L Shi, L Su, S Cheng… - Proceedings of the 47th …, 2024 - dl.acm.org

Graph Neural Networks (GNNs) have evolved to understand graph structures through
recursive exchanges and aggregations among nodes. To enhance robustness, self …

被引用次数：129 相关文章所有 2 个版本

[PDF] arxiv.org

Metamath: Bootstrap your own mathematical questions for large language models

L Yu, W Jiang, H Shi, J Yu, Z Liu, Y Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org

Large language models (LLMs) have pushed the limits of natural language understanding
and exhibited excellent problem-solving ability. Despite the great success, most existing …

被引用次数：391 相关文章所有 5 个版本

[PDF] arxiv.org

Scaling relationship on learning mathematical reasoning with large language models

Z Yuan, H Yuan, C Li, G Dong, K Lu, C Tan… - arXiv preprint arXiv …, 2023 - arxiv.org

Mathematical reasoning is a challenging task for large language models (LLMs), while the
scaling relationship of it with respect to LLM capacity is under-explored. In this paper, we …

被引用次数：167 相关文章所有 4 个版本

[PDF] arxiv.org

Efficient large language models: A survey

Z Wan, X Wang, C Liu, S Alam, Y Zheng, J Liu… - arXiv preprint arXiv …, 2023 - arxiv.org

Large Language Models (LLMs) have demonstrated remarkable capabilities in important
tasks such as natural language understanding and language generation, and thus have the …

被引用次数：118 相关文章所有 7 个版本

[PDF] neurips.cc

Large language models as commonsense knowledge for large-scale task planning

Z Zhao, WS Lee, D Hsu - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Large-scale task planning is a major challenge. Recent work exploits large language
models (LLMs) directly as a policy and shows surprisingly interesting results. This paper …

被引用次数：159 相关文章所有 7 个版本

[PDF] arxiv.org

A survey on transformer compression

Y Tang, Y Wang, J Guo, Z Tu, K Han, H Hu… - arXiv preprint arXiv …, 2024 - arxiv.org

Large models based on the Transformer architecture play increasingly vital roles in artificial
intelligence, particularly within the realms of natural language processing (NLP) and …

被引用次数：27 相关文章所有 2 个版本

[PDF] arxiv.org

Explanations from large language models make small reasoners better

S Li, J Chen, Y Shen, Z Chen, X Zhang, Z Li… - arXiv preprint arXiv …, 2022 - arxiv.org

Integrating free-text explanations to in-context learning of large language models (LLM) is
shown to elicit strong reasoning capabilities along with reasonable explanations. In this …

被引用次数：119 相关文章所有 3 个版本

[PDF] openreview.net

Lumos: Learning agents with unified data, modular design, and open-source llms

D Yin, F Brahman, A Ravichander… - ICLR 2024 Workshop …, 2023 - openreview.net

We introduce Lumos, a novel framework for training language agents that employs a unified
data format and a modular architecture based on open-source large language models …

被引用次数：32 相关文章所有 2 个版本