A survey on evaluation of large language models

Y Chang, X Wang, J Wang, Y Wu, L Yang… - ACM Transactions on …, 2024 - dl.acm.org
Large language models (LLMs) are gaining increasing popularity in both academia and
industry, owing to their unprecedented performance in various applications. As LLMs …

Exploring the frontiers of llms in psychological applications: A comprehensive review

L Ke, S Tong, P Cheng, K Peng - arXiv preprint arXiv:2401.01519, 2024 - arxiv.org
This paper explores the frontiers of large language models (LLMs) in psychology
applications. Psychology has undergone several theoretical changes, and the current use of …

A survey of large language models for healthcare: from data, technology, and applications to accountability and ethics

K He, R Mao, Q Lin, Y Ruan, X Lan, M Feng… - arXiv preprint arXiv …, 2023 - arxiv.org
The utilization of large language models (LLMs) in the Healthcare domain has generated
both excitement and concern due to their ability to effectively respond to freetext queries with …

Evaluating large language models for radiology natural language processing

Z Liu, T Zhong, Y Li, Y Zhang, Y Pan, Z Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org
The rise of large language models (LLMs) has marked a pivotal shift in the field of natural
language processing (NLP). LLMs have revolutionized a multitude of domains, and they …

Security and privacy challenges of large language models: A survey

BC Das, MH Amini, Y Wu - arXiv preprint arXiv:2402.00888, 2024 - arxiv.org
Large Language Models (LLMs) have demonstrated extraordinary capabilities and
contributed to multiple fields, such as generating and summarizing text, language …

Large language models for uavs: Current state and pathways to the future

S Javaid, H Fahim, B He… - IEEE Open Journal of …, 2024 - ieeexplore.ieee.org
Unmanned Aerial Vehicles (UAVs) have emerged as a transformative technology across
diverse sectors, offering adaptable solutions to complex challenges in both military and …

Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench

J Huang, W Wang, EJ Li, MH Lam, S Ren… - arXiv preprint arXiv …, 2023 - arxiv.org
Large Language Models (LLMs) have recently showcased their remarkable capacities, not
only in natural language processing tasks but also across diverse domains such as clinical …

Understanding and improving fairness in cognitive diagnosis

Z Zhang, L Wu, Q Liu, J Liu, Z Huang, Y Yin… - Science China …, 2024 - Springer
Intelligent education is a significant application of artificial intelligence. One of the key
research topics in intelligence education is cognitive diagnosis, which aims to gauge the …

Context-aware code generation framework for code repositories: Local, global, and third-party library awareness

D Liao, S Pan, Q Huang, X Ren, Z Xing, H Jin… - arXiv preprint arXiv …, 2023 - arxiv.org
Code generation tools are essential to help developers in the software development
process. Existing tools often disconnect with the working context, ie, the code repository …

FairLISA: Fair User Modeling with Limited Sensitive Attributes Information

Q Liu, H Jiang, F Wang, Y Zhuang… - Advances in …, 2024 - proceedings.neurips.cc
User modeling techniques profile users' latent characteristics (eg, preference) from their
observed behaviors, and play a crucial role in decision-making. Unfortunately, traditional …