Challenges and applications of large language models

J Kaddour, J Harris, M Mozes, H Bradley… - arXiv preprint arXiv …, 2023 - arxiv.org
Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …

Faith and fate: Limits of transformers on compositionality

N Dziri, X Lu, M Sclar, XL Li, L Jiang… - Advances in …, 2024 - proceedings.neurips.cc
Transformer large language models (LLMs) have sparked admiration for their exceptional
performance on tasks that demand intricate multi-step reasoning. Yet, these models …

The rise and potential of large language model based agents: A survey

Z Xi, W Chen, X Guo, W He, Y Ding, B Hong… - arXiv preprint arXiv …, 2023 - arxiv.org
For a long time, humanity has pursued artificial intelligence (AI) equivalent to or surpassing
the human level, with AI agents considered a promising vehicle for this pursuit. AI agents are …

On the opportunities and risks of foundation models

R Bommasani, DA Hudson, E Adeli, R Altman… - arXiv preprint arXiv …, 2021 - arxiv.org
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

Language models as agent models

J Andreas - arXiv preprint arXiv:2212.01681, 2022 - arxiv.org
Language models (LMs) are trained on collections of documents, written by individual
human agents to achieve specific goals in an outside world. During training, LMs have …

Imitating human behaviour with diffusion models

T Pearce, T Rashid, A Kanervisto, D Bignell… - arXiv preprint arXiv …, 2023 - arxiv.org
Diffusion models have emerged as powerful generative models in the text-to-image domain.
This paper studies their application as observation-to-action models for imitating human …

Embers of autoregression: Understanding large language models through the problem they are trained to solve

RT McCoy, S Yao, D Friedman, M Hardy… - arXiv preprint arXiv …, 2023 - arxiv.org
The widespread adoption of large language models (LLMs) makes it important to recognize
their strengths and limitations. We argue that in order to develop a holistic understanding of …

Likelihood-based diffusion language models

I Gulrajani, TB Hashimoto - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Despite a growing interest in diffusion-based language models, existing work has not shown
that these models can attain nontrivial likelihoods on standard language modeling …

The positive influences of renewable energy consumption on financial development and economic growth

L Zhe, S Yüksel, H Dinçer, S Mukhtarov… - Sage …, 2021 - journals.sagepub.com
This study aims to evaluate the positive impacts of renewable energy usage on the
economic growth and financial development. For this purpose, an evaluation has been …

Hypro: A hybridly normalized probabilistic model for long-horizon prediction of event sequences

S Xue, X Shi, J Zhang, H Mei - Advances in Neural …, 2022 - proceedings.neurips.cc
In this paper, we tackle the important yet under-investigated problem of making long-horizon
prediction of event sequences. Existing state-of-the-art models do not perform well at this …