Explainability for large language models: A survey

H Zhao, H Chen, F Yang, N Liu, H Deng, H Cai… - ACM Transactions on …, 2024 - dl.acm.org
Large language models (LLMs) have demonstrated impressive capabilities in natural
language processing. However, their internal mechanisms are still unclear and this lack of …

Can large language models reason about medical questions?

V Liévin, CE Hother, AG Motzfeldt, O Winther - Patterns, 2024 - cell.com
Although large language models often produce impressive outputs, it remains unclear how
they perform in real-world scenarios requiring strong reasoning skills and expert domain …

Spotting llms with binoculars: Zero-shot detection of machine-generated text

A Hans, A Schwarzschild, V Cherepanova… - arXiv preprint arXiv …, 2024 - arxiv.org
Detecting text generated by modern large language models is thought to be hard, as both
LLMs and humans can exhibit a wide range of complex behaviors. However, we find that a …

GP-GPT: Large Language Model for Gene-Phenotype Mapping

Y Lyu, Z Wu, L Zhang, J Zhang, Y Li, W Ruan… - arXiv preprint arXiv …, 2024 - arxiv.org
Pre-trained large language models (LLMs) have attracted increasing attention in biomedical
domains due to their success in natural language processing. However, the complex traits …

Informed named entity recognition decoding for generative language models

T Deußer, L Hillebrand, C Bauckhage… - arXiv preprint arXiv …, 2023 - arxiv.org
Ever-larger language models with ever-increasing capabilities are by now well-established
text processing tools. Alas, information extraction tasks such as named entity recognition are …

Self-specialization: Uncovering latent expertise within large language models

J Kang, H Luo, Y Zhu, J Hansen, J Glass… - Findings of the …, 2024 - aclanthology.org
Recent works have demonstrated the effectiveness of self-alignment in which a large
language model is aligned to follow general instructions using instructional data generated …

If the sources could talk: Evaluating large language models for research assistance in history

GG Garcia, C Weilbach - arXiv preprint arXiv:2310.10808, 2023 - arxiv.org
The recent advent of powerful Large-Language Models (LLM) provides a new
conversational form of inquiry into historical memory (or, training data, in this case). We …

[HTML][HTML] Advancing open domain dialog: The fifth alexa prize socialbot grand challenge

M Johnston, C Flagg, A Gottardi, S Sahai, Y Lu, S Sagi… - 2023 - amazon.science
Creating conversational dialog systems that are able to converse naturally and engagingly
with humans on any topic remains one of the fundamental challenges of artificial …

Controlled randomness improves the performance of transformer models

T Deußer, C Zhao, W Krämer, D Leonhard… - arXiv preprint arXiv …, 2023 - arxiv.org
During the pre-training step of natural language models, the main objective is to learn a
general representation of the pre-training dataset, usually requiring large amounts of textual …

SRBerta—A Transformer Language Model for Serbian Cyrillic Legal Texts

M Bogdanović, J Kocić, L Stoimenov - Information, 2024 - mdpi.com
Language is a unique ability of human beings. Although relatively simple for humans, the
ability to understand human language is a highly complex task for machines. For a machine …