A survey of large language models
Language is essentially a complex, intricate system of human expressions governed by
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …
Datasets for large language models: A comprehensive survey
This paper embarks on an exploration into the Large Language Model (LLM) datasets,
which play a crucial role in the remarkable advancements of LLMs. The datasets serve as …
which play a crucial role in the remarkable advancements of LLMs. The datasets serve as …
Inadequacies of large language model benchmarks in the era of generative artificial intelligence
The rapid rise in popularity of Large Language Models (LLMs) with emerging capabilities
has spurred public curiosity to evaluate and compare different LLMs, leading many …
has spurred public curiosity to evaluate and compare different LLMs, leading many …
Task contamination: Language models may not be few-shot anymore
C Li, J Flanigan - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
Large language models (LLMs) offer impressive performance in various zero-shot and few-
shot tasks. However, their success in zero-shot or few-shot settings may be affected by task …
shot tasks. However, their success in zero-shot or few-shot settings may be affected by task …
How much are llms contaminated? a comprehensive survey and the llmsanitize library
With the rise of Large Language Models (LLMs) in recent years, new opportunities are
emerging, but also new challenges, and contamination is quickly becoming critical …
emerging, but also new challenges, and contamination is quickly becoming critical …
Solar 10.7 b: Scaling large language models with simple yet effective depth up-scaling
We introduce SOLAR 10.7 B, a large language model (LLM) with 10.7 billion parameters,
demonstrating superior performance in various natural language processing (NLP) tasks …
demonstrating superior performance in various natural language processing (NLP) tasks …
Cambrian-1: A fully open, vision-centric exploration of multimodal llms
We introduce Cambrian-1, a family of multimodal LLMs (MLLMs) designed with a vision-
centric approach. While stronger language models can enhance multimodal capabilities, the …
centric approach. While stronger language models can enhance multimodal capabilities, the …
An llm-free multi-dimensional benchmark for mllms hallucination evaluation
Despite making significant progress in multi-modal tasks, current Multi-modal Large
Language Models (MLLMs) encounter the significant challenge of hallucination, which may …
Language Models (MLLMs) encounter the significant challenge of hallucination, which may …
Ideal: Influence-driven selective annotations empower in-context learners in large language models
In-context learning is a promising paradigm that utilizes in-context examples as prompts for
the predictions of large language models. These prompts are crucial for achieving strong …
the predictions of large language models. These prompts are crucial for achieving strong …
Promptbench: A unified library for evaluation of large language models
The evaluation of large language models (LLMs) is crucial to assess their performance and
mitigate potential security risks. In this paper, we introduce PromptBench, a unified library to …
mitigate potential security risks. In this paper, we introduce PromptBench, a unified library to …