Large language models for data annotation: A survey
Data annotation generally refers to the labeling or generating of raw data with relevant
information, which could be used for improving the efficacy of machine learning models. The …
information, which could be used for improving the efficacy of machine learning models. The …
A survey on model compression for large language models
Abstract Large Language Models (LLMs) have transformed natural language processing
tasks successfully. Yet, their large size and high computational needs pose challenges for …
tasks successfully. Yet, their large size and high computational needs pose challenges for …
Graphgpt: Graph instruction tuning for large language models
Graph Neural Networks (GNNs) have evolved to understand graph structures through
recursive exchanges and aggregations among nodes. To enhance robustness, self …
recursive exchanges and aggregations among nodes. To enhance robustness, self …
Metamath: Bootstrap your own mathematical questions for large language models
Large language models (LLMs) have pushed the limits of natural language understanding
and exhibited excellent problem-solving ability. Despite the great success, most existing …
and exhibited excellent problem-solving ability. Despite the great success, most existing …
Scaling relationship on learning mathematical reasoning with large language models
Mathematical reasoning is a challenging task for large language models (LLMs), while the
scaling relationship of it with respect to LLM capacity is under-explored. In this paper, we …
scaling relationship of it with respect to LLM capacity is under-explored. In this paper, we …
Efficient large language models: A survey
Large Language Models (LLMs) have demonstrated remarkable capabilities in important
tasks such as natural language understanding and language generation, and thus have the …
tasks such as natural language understanding and language generation, and thus have the …
Large language models as commonsense knowledge for large-scale task planning
Large-scale task planning is a major challenge. Recent work exploits large language
models (LLMs) directly as a policy and shows surprisingly interesting results. This paper …
models (LLMs) directly as a policy and shows surprisingly interesting results. This paper …
A survey on transformer compression
Large models based on the Transformer architecture play increasingly vital roles in artificial
intelligence, particularly within the realms of natural language processing (NLP) and …
intelligence, particularly within the realms of natural language processing (NLP) and …
Explanations from large language models make small reasoners better
Integrating free-text explanations to in-context learning of large language models (LLM) is
shown to elicit strong reasoning capabilities along with reasonable explanations. In this …
shown to elicit strong reasoning capabilities along with reasonable explanations. In this …
Lumos: Learning agents with unified data, modular design, and open-source llms
We introduce Lumos, a novel framework for training language agents that employs a unified
data format and a modular architecture based on open-source large language models …
data format and a modular architecture based on open-source large language models …