Falcon-40B: an open large language model with state-of-the-art performance. 2023

H Zhao, H Chen, F Yang, N Liu, H Deng, H Cai… - ACM Transactions on …, 2024 - dl.acm.org

Large language models (LLMs) have demonstrated impressive capabilities in natural
language processing. However, their internal mechanisms are still unclear and this lack of …

被引用次数：255 相关文章所有 5 个版本

[PDF] cell.com Full View

Can large language models reason about medical questions?

V Liévin, CE Hother, AG Motzfeldt, O Winther - Patterns, 2024 - cell.com

Although large language models often produce impressive outputs, it remains unclear how
they perform in real-world scenarios requiring strong reasoning skills and expert domain …

被引用次数：216 相关文章所有 9 个版本

[PDF] arxiv.org

Spotting llms with binoculars: Zero-shot detection of machine-generated text

A Hans, A Schwarzschild, V Cherepanova… - arXiv preprint arXiv …, 2024 - arxiv.org

Detecting text generated by modern large language models is thought to be hard, as both
LLMs and humans can exhibit a wide range of complex behaviors. However, we find that a …

被引用次数：16 相关文章所有 4 个版本

[PDF] arxiv.org

GP-GPT: Large Language Model for Gene-Phenotype Mapping

Y Lyu, Z Wu, L Zhang, J Zhang, Y Li, W Ruan… - arXiv preprint arXiv …, 2024 - arxiv.org

Pre-trained large language models (LLMs) have attracted increasing attention in biomedical
domains due to their success in natural language processing. However, the complex traits …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Informed named entity recognition decoding for generative language models

T Deußer, L Hillebrand, C Bauckhage… - arXiv preprint arXiv …, 2023 - arxiv.org

Ever-larger language models with ever-increasing capabilities are by now well-established
text processing tools. Alas, information extraction tasks such as named entity recognition are …

被引用次数：4 相关文章所有 2 个版本

[PDF] aclanthology.org

Self-specialization: Uncovering latent expertise within large language models

J Kang, H Luo, Y Zhu, J Hansen, J Glass… - Findings of the …, 2024 - aclanthology.org

Recent works have demonstrated the effectiveness of self-alignment in which a large
language model is aligned to follow general instructions using instructional data generated …

被引用次数：4 相关文章所有 4 个版本

[PDF] arxiv.org

If the sources could talk: Evaluating large language models for research assistance in history

GG Garcia, C Weilbach - arXiv preprint arXiv:2310.10808, 2023 - arxiv.org

The recent advent of powerful Large-Language Models (LLM) provides a new
conversational form of inquiry into historical memory (or, training data, in this case). We …

被引用次数：6 相关文章所有 2 个版本

[HTML] amazon.science

[HTML][HTML] Advancing open domain dialog: The fifth alexa prize socialbot grand challenge

M Johnston, C Flagg, A Gottardi, S Sahai, Y Lu, S Sagi… - 2023 - amazon.science

Creating conversational dialog systems that are able to converse naturally and engagingly
with humans on any topic remains one of the fundamental challenges of artificial …

被引用次数：9 相关文章所有 3 个版本

[PDF] arxiv.org

Controlled randomness improves the performance of transformer models

T Deußer, C Zhao, W Krämer, D Leonhard… - arXiv preprint arXiv …, 2023 - arxiv.org

During the pre-training step of natural language models, the main objective is to learn a
general representation of the pre-training dataset, usually requiring large amounts of textual …

被引用次数：1 相关文章所有 5 个版本

[PDF] mdpi.com

SRBerta—A Transformer Language Model for Serbian Cyrillic Legal Texts

M Bogdanović, J Kocić, L Stoimenov - Information, 2024 - mdpi.com

Language is a unique ability of human beings. Although relatively simple for humans, the
ability to understand human language is a highly complex task for machines. For a machine …

被引用次数：1 相关文章所有 4 个版本