Challenges and applications of large language models
Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …
Faith and fate: Limits of transformers on compositionality
Transformer large language models (LLMs) have sparked admiration for their exceptional
performance on tasks that demand intricate multi-step reasoning. Yet, these models …
performance on tasks that demand intricate multi-step reasoning. Yet, these models …
The rise and potential of large language model based agents: A survey
For a long time, humanity has pursued artificial intelligence (AI) equivalent to or surpassing
the human level, with AI agents considered a promising vehicle for this pursuit. AI agents are …
the human level, with AI agents considered a promising vehicle for this pursuit. AI agents are …
On the opportunities and risks of foundation models
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …
Language models as agent models
J Andreas - arXiv preprint arXiv:2212.01681, 2022 - arxiv.org
Language models (LMs) are trained on collections of documents, written by individual
human agents to achieve specific goals in an outside world. During training, LMs have …
human agents to achieve specific goals in an outside world. During training, LMs have …
Imitating human behaviour with diffusion models
Diffusion models have emerged as powerful generative models in the text-to-image domain.
This paper studies their application as observation-to-action models for imitating human …
This paper studies their application as observation-to-action models for imitating human …
Embers of autoregression: Understanding large language models through the problem they are trained to solve
The widespread adoption of large language models (LLMs) makes it important to recognize
their strengths and limitations. We argue that in order to develop a holistic understanding of …
their strengths and limitations. We argue that in order to develop a holistic understanding of …
Likelihood-based diffusion language models
I Gulrajani, TB Hashimoto - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Despite a growing interest in diffusion-based language models, existing work has not shown
that these models can attain nontrivial likelihoods on standard language modeling …
that these models can attain nontrivial likelihoods on standard language modeling …
The positive influences of renewable energy consumption on financial development and economic growth
This study aims to evaluate the positive impacts of renewable energy usage on the
economic growth and financial development. For this purpose, an evaluation has been …
economic growth and financial development. For this purpose, an evaluation has been …
Hypro: A hybridly normalized probabilistic model for long-horizon prediction of event sequences
In this paper, we tackle the important yet under-investigated problem of making long-horizon
prediction of event sequences. Existing state-of-the-art models do not perform well at this …
prediction of event sequences. Existing state-of-the-art models do not perform well at this …