A bibliometric review of large language models research from 2017 to 2023
Large language models (LLMs) are a class of language models that have demonstrated
outstanding performance across a range of natural language processing (NLP) tasks and …
outstanding performance across a range of natural language processing (NLP) tasks and …
Trends and challenges of real-time learning in large language models: A critical review
M Jovanovic, P Voss - arXiv preprint arXiv:2404.18311, 2024 - arxiv.org
Real-time learning concerns the ability of learning systems to acquire knowledge over time,
enabling their adaptation and generalization to novel tasks. It is a critical ability for …
enabling their adaptation and generalization to novel tasks. It is a critical ability for …
Dyval 2: Dynamic evaluation of large language models by meta probing agents
Evaluation of large language models (LLMs) has raised great concerns in the community
due to the issue of data contamination. Existing work designed evaluation protocols using …
due to the issue of data contamination. Existing work designed evaluation protocols using …
On catastrophic inheritance of large foundation models
Large foundation models (LFMs) are claiming incredible performances. Yet great concerns
have been raised about their mythic and uninterpreted potentials not only in machine …
have been raised about their mythic and uninterpreted potentials not only in machine …
Dynamic Evaluation of Large Language Models by Meta Probing Agents
Evaluation of large language models (LLMs) has raised great concerns in the community
due to the issue of data contamination. Existing work designed evaluation protocols using …
due to the issue of data contamination. Existing work designed evaluation protocols using …
NPHardEval4V: A Dynamic Reasoning Benchmark of Multimodal Large Language Models
Understanding the reasoning capabilities of Multimodal Large Language Models (MLLMs) is
an important area of research. In this study, we introduce a dynamic benchmark …
an important area of research. In this study, we introduce a dynamic benchmark …
CausalBench: A Comprehensive Benchmark for Causal Learning Capability of Large Language Models
Causality reveals fundamental principles behind data distributions in real-world scenarios,
and the capability of large language models (LLMs) to understand causality directly impacts …
and the capability of large language models (LLMs) to understand causality directly impacts …
MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures
Evaluating large language models (LLMs) is challenging. Traditional ground-truth-based
benchmarks fail to capture the comprehensiveness and nuance of real-world queries, while …
benchmarks fail to capture the comprehensiveness and nuance of real-world queries, while …
Large Language Models in Biomedical and Health Informatics: A Bibliometric Review
Large Language Models (LLMs) have rapidly become important tools in Biomedical and
Health Informatics (BHI), enabling new ways to analyze data, treat patients, and conduct …
Health Informatics (BHI), enabling new ways to analyze data, treat patients, and conduct …
Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilities
This study intends to systematically disentangle pure logic reasoning and text understanding
by investigating the contrast across abstract and contextualized logical problems from a …
by investigating the contrast across abstract and contextualized logical problems from a …