A survey on evaluation of large language models
Large language models (LLMs) are gaining increasing popularity in both academia and
industry, owing to their unprecedented performance in various applications. As LLMs …
industry, owing to their unprecedented performance in various applications. As LLMs …
An AI-Resilient Text Rendering Technique for Reading and Skimming Documents
Readers find text difficult to consume for many reasons. Summarization can address some of
these difficulties, but introduce others, such as omitting, misrepresenting, or hallucinating …
these difficulties, but introduce others, such as omitting, misrepresenting, or hallucinating …
Exploring the prospects and challenges of large language models for language learning and production
LLMs such as GPT-3 (Brown et al., 2020), PaLM (Chowdhery et al., 2022), and LLaMA
(Touvron et al., 2023) consist of large neural networks containing hundreds of billions (or …
(Touvron et al., 2023) consist of large neural networks containing hundreds of billions (or …
Can large language models understand uncommon meanings of common words?
Large language models (LLMs) like ChatGPT have shown significant advancements across
diverse natural language understanding (NLU) tasks, including intelligent dialogue and …
diverse natural language understanding (NLU) tasks, including intelligent dialogue and …
大语言模型评估技术研究进展.
赵睿卓, 曲紫畅, 陈国英, 王坤龙… - … Ju Cai Ji Yu Chu Li, 2024 - search.ebscohost.com
随着大语言模型的广泛应用, 针对大语言模型的评估工作变得至关重要. 除了大语言模型在下游
任务上的表现情况需要评估外, 其存在的一些潜在风险更需要评估, 例如大语言模型可能违背 …
任务上的表现情况需要评估外, 其存在的一些潜在风险更需要评估, 例如大语言模型可能违背 …