Explainability for large language models: A survey
Large language models (LLMs) have demonstrated impressive capabilities in natural
language processing. However, their internal mechanisms are still unclear and this lack of …
language processing. However, their internal mechanisms are still unclear and this lack of …
Can large language models reason about medical questions?
Although large language models often produce impressive outputs, it remains unclear how
they perform in real-world scenarios requiring strong reasoning skills and expert domain …
they perform in real-world scenarios requiring strong reasoning skills and expert domain …
Spotting llms with binoculars: Zero-shot detection of machine-generated text
Detecting text generated by modern large language models is thought to be hard, as both
LLMs and humans can exhibit a wide range of complex behaviors. However, we find that a …
LLMs and humans can exhibit a wide range of complex behaviors. However, we find that a …
GP-GPT: Large Language Model for Gene-Phenotype Mapping
Pre-trained large language models (LLMs) have attracted increasing attention in biomedical
domains due to their success in natural language processing. However, the complex traits …
domains due to their success in natural language processing. However, the complex traits …
Informed named entity recognition decoding for generative language models
Ever-larger language models with ever-increasing capabilities are by now well-established
text processing tools. Alas, information extraction tasks such as named entity recognition are …
text processing tools. Alas, information extraction tasks such as named entity recognition are …
Self-specialization: Uncovering latent expertise within large language models
Recent works have demonstrated the effectiveness of self-alignment in which a large
language model is aligned to follow general instructions using instructional data generated …
language model is aligned to follow general instructions using instructional data generated …
If the sources could talk: Evaluating large language models for research assistance in history
GG Garcia, C Weilbach - arXiv preprint arXiv:2310.10808, 2023 - arxiv.org
The recent advent of powerful Large-Language Models (LLM) provides a new
conversational form of inquiry into historical memory (or, training data, in this case). We …
conversational form of inquiry into historical memory (or, training data, in this case). We …
[HTML][HTML] Advancing open domain dialog: The fifth alexa prize socialbot grand challenge
M Johnston, C Flagg, A Gottardi, S Sahai, Y Lu, S Sagi… - 2023 - amazon.science
Creating conversational dialog systems that are able to converse naturally and engagingly
with humans on any topic remains one of the fundamental challenges of artificial …
with humans on any topic remains one of the fundamental challenges of artificial …
Controlled randomness improves the performance of transformer models
T Deußer, C Zhao, W Krämer, D Leonhard… - arXiv preprint arXiv …, 2023 - arxiv.org
During the pre-training step of natural language models, the main objective is to learn a
general representation of the pre-training dataset, usually requiring large amounts of textual …
general representation of the pre-training dataset, usually requiring large amounts of textual …
SRBerta—A Transformer Language Model for Serbian Cyrillic Legal Texts
M Bogdanović, J Kocić, L Stoimenov - Information, 2024 - mdpi.com
Language is a unique ability of human beings. Although relatively simple for humans, the
ability to understand human language is a highly complex task for machines. For a machine …
ability to understand human language is a highly complex task for machines. For a machine …