A survey on evaluation of large language models
Large language models (LLMs) are gaining increasing popularity in both academia and
industry, owing to their unprecedented performance in various applications. As LLMs …
industry, owing to their unprecedented performance in various applications. As LLMs …
Evaluating large language models: A comprehensive survey
Large language models (LLMs) have demonstrated remarkable capabilities across a broad
spectrum of tasks. They have attracted significant attention and been deployed in numerous …
spectrum of tasks. They have attracted significant attention and been deployed in numerous …
A survey on large language models: Applications, challenges, limitations, and practical usage
Within the vast expanse of computerized language processing, a revolutionary entity known
as Large Language Models (LLMs) has emerged, wielding immense power in its capacity to …
as Large Language Models (LLMs) has emerged, wielding immense power in its capacity to …
Large language models: a comprehensive survey of its applications, challenges, limitations, and future prospects
Within the vast expanse of computerized language processing, a revolutionary entity known
as Large Language Models (LLMs) has emerged, wielding immense power in its capacity to …
as Large Language Models (LLMs) has emerged, wielding immense power in its capacity to …
[PDF][PDF] Efficient large language models: A survey
Abstract Large Language Models (LLMs) have demonstrated remarkable capabilities in
important tasks such as natural language understanding, language generation, and …
important tasks such as natural language understanding, language generation, and …
Dyval: Graph-informed dynamic evaluation of large language models
Large language models (LLMs) have achieved remarkable performance in various
evaluation benchmarks. However, concerns about their performance are raised on potential …
evaluation benchmarks. However, concerns about their performance are raised on potential …
Benchmarking llms via uncertainty quantification
The proliferation of open-source Large Language Models (LLMs) from various institutions
has highlighted the urgent need for comprehensive evaluation methods. However, current …
has highlighted the urgent need for comprehensive evaluation methods. However, current …
Are large language model-based evaluators the solution to scaling up multilingual evaluation?
Large Language Models (LLMs) have demonstrated impressive performance on Natural
Language Processing (NLP) tasks, such as Question Answering, Summarization, and …
Language Processing (NLP) tasks, such as Question Answering, Summarization, and …
Towards an understanding of large language models in software engineering tasks
Large Language Models (LLMs) have drawn widespread attention and research due to their
astounding performance in tasks such as text generation and reasoning. Derivative …
astounding performance in tasks such as text generation and reasoning. Derivative …
State of what art? a call for multi-prompt llm evaluation
Recent advances in large language models (LLMs) have led to the development of various
evaluation benchmarks. These benchmarks typically rely on a single instruction template for …
evaluation benchmarks. These benchmarks typically rely on a single instruction template for …