Foundational challenges in assuring alignment and safety of large language models
This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …
language models (LLMs). These challenges are organized into three different categories …
A phase transition in diffusion models reveals the hierarchical nature of data
Understanding the structure of real data is paramount in advancing modern deep-learning
methodologies. Natural data such as images are believed to be composed of features …
methodologies. Natural data such as images are believed to be composed of features …
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks
Fine-tuning large pre-trained models has become the de facto strategy for developing both
task-specific and general-purpose machine learning systems, including developing models …
task-specific and general-purpose machine learning systems, including developing models …
Eclipse: A resource-efficient text-to-image prior for image generations
Abstract Text-to-image (T2I) diffusion models notably the unCLIP models (eg DALL-E-2)
achieve state-of-the-art (SOTA) performance on various compositional T2I benchmarks at …
achieve state-of-the-art (SOTA) performance on various compositional T2I benchmarks at …
Who's in and who's out? A case study of multimodal CLIP-filtering in DataComp
As training datasets become increasingly drawn from unstructured, uncontrolled
environments such as the web, researchers and industry practitioners have increasingly …
environments such as the web, researchers and industry practitioners have increasingly …
How capable can a transformer become? a study on synthetic, interpretable tasks
Transformers trained on huge text corpora exhibit a remarkable set of capabilities, eg,
performing simple logical operations. Given the inherent compositional nature of language …
performing simple logical operations. Given the inherent compositional nature of language …
An analytic theory of creativity in convolutional diffusion models
We obtain the first analytic, interpretable and predictive theory of creativity in convolutional
diffusion models. Indeed, score-based diffusion models can generate highly creative images …
diffusion models. Indeed, score-based diffusion models can generate highly creative images …
Why do animals need shaping? a theory of task composition and curriculum learning
Diverse studies in systems neuroscience begin with extended periods of training known as'
shaping'procedures. These involve progressively studying component parts of more …
shaping'procedures. These involve progressively studying component parts of more …
Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation
The literature on text-to-image generation is plagued by issues of faithfully composing
entities with relations. But there lacks a formal understanding of how entity-relation …
entities with relations. But there lacks a formal understanding of how entity-relation …
Initialization is Critical to Whether Transformers Fit Composite Functions by Inference or Memorizing
Transformers have shown impressive capabilities across various tasks, but their
performance on compositional problems remains a topic of debate. In this work, we …
performance on compositional problems remains a topic of debate. In this work, we …