Cfbench: A comprehensive constraints-following benchmark for llms

T Zhang, Y Shen, W Luo, Y Zhang, H Liang… - arXiv preprint arXiv …, 2024 - arxiv.org
The adeptness of Large Language Models (LLMs) in comprehending and following natural
language instructions is critical for their deployment in sophisticated real-world applications …

Chinese tiny llm: Pretraining a chinese-centric large language model

X Du, Z Yu, S Gao, D Pan, Y Cheng, Z Ma… - arXiv preprint arXiv …, 2024 - arxiv.org
In this study, we introduce CT-LLM, a 2B large language model (LLM) that illustrates a
pivotal shift towards prioritizing the Chinese language in developing LLMs. Uniquely …

Survey of cultural awareness in language models: Text and beyond

S Pawar, J Park, J Jin, A Arora, J Myung… - arXiv preprint arXiv …, 2024 - arxiv.org
Large-scale deployment of large language models (LLMs) in various applications, such as
chatbots and virtual assistants, requires LLMs to be culturally sensitive to the user to ensure …

CS4: Measuring the Creativity of Large Language Models Automatically by Controlling the Number of Story-Writing Constraints

A Atmakuru, J Nainani, RSR Bheemreddy… - arXiv preprint arXiv …, 2024 - arxiv.org
Evaluating the creativity of large language models (LLMs) in story writing is difficult because
LLM-generated stories could seemingly look creative but be very similar to some existing …

IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization

X Zhang, H Yu, C Fu, F Huang, Y Li - arXiv preprint arXiv:2411.06208, 2024 - arxiv.org
In the realm of large language models (LLMs), the ability of models to accurately follow
instructions is paramount as more agents and applications leverage LLMs for construction …

Latent Learningscape Guided In-context Learning

A Zhou, S Jiang, Y Liu, Y Wu, K Kuang… - Findings of the …, 2024 - aclanthology.org
The growing interest in leveraging large language models is driven by their exceptional
imitation and reasoning capabilities. In-context learning (ICL), a streamlined method, has …

WenMind: A Comprehensive Benchmark for Evaluating Large Language Models in Chinese Classical Literature and Language Arts

J Cao, Y Liu, Y Shi, K Ding, L Jin - The Thirty-eight Conference on Neural … - openreview.net
Large Language Models (LLMs) have made significant advancements across numerous
domains, but their capabilities in Chinese Classical Literature and Language Arts (CCLLA) …