Why do animals need shaping? a theory of task composition and curriculum learning

JH Lee, SS Mannelli, A Saxe - arXiv preprint arXiv:2402.18361, 2024 - arxiv.org
Diverse studies in systems neuroscience begin with extended periods of training known as'
shaping'procedures. These involve progressively studying component parts of more …

Don't Cut Corners: Exact Conditions for Modularity in Biologically Inspired Representations

W Dorrell, K Hsu, L Hollingsworth, JH Lee, J Wu… - arXiv preprint arXiv …, 2024 - arxiv.org
Why do biological and artificial neurons sometimes modularise, each encoding a single
meaningful variable, and sometimes entangle their representation of many variables? In this …

A Complexity-Based Theory of Compositionality

E Elmoznino, T Jiralerspong, Y Bengio… - arXiv preprint arXiv …, 2024 - arxiv.org
Compositionality is believed to be fundamental to intelligence. In humans, it underlies the
structure of thought, language, and higher-level reasoning. In AI, compositional …

Flexible task abstractions emerge in linear networks with fast and bounded units

K Sandbrink, JP Bauer, AM Proca, AM Saxe… - arXiv preprint arXiv …, 2024 - arxiv.org
Animals survive in dynamic environments changing at arbitrary timescales, but such data
distribution shifts are a challenge to neural networks. To adapt to change, neural systems …

Compositional Risk Minimization

D Mahajan, M Pezeshki, I Mitliagkas, K Ahuja… - arXiv preprint arXiv …, 2024 - arxiv.org
In this work, we tackle a challenging and extreme form of subpopulation shift, which is
termed compositional shift. Under compositional shifts, some combinations of attributes are …

When can transformers compositionally generalize in-context?

S Kobayashi, S Schug, Y Akram, F Redhardt… - arXiv preprint arXiv …, 2024 - arxiv.org
Many tasks can be composed from a few independent components. This gives rise to a
combinatorial explosion of possible tasks, only some of which might be encountered during …

When does compositional structure yield compositional generalization? A kernel theory

S Lippl, K Stachenfeld - arXiv preprint arXiv:2405.16391, 2024 - arxiv.org
Compositional generalization (the ability to respond correctly to novel combinations of
familiar components) is thought to be a cornerstone of intelligent behavior. Compositionally …

Enhancing Generalization in Sparse Mixture of Experts Models: The Case for Increased Expert Activation in Compositional Tasks

J Zhao - arXiv preprint arXiv:2410.13964, 2024 - arxiv.org
As Transformer models grow in complexity, their ability to generalize to novel, compositional
tasks becomes crucial. This study challenges conventional wisdom about sparse activation …

Modularity in Biologically Inspired Representations Depends on Task Variable Range Independence

W Dorrell, K Hsu, L Hollingsworth, JH Lee, J Wu… - ICML 2024 Workshop on … - openreview.net
Artificial and biological neurons sometimes modularise into disjoint groups each encoding a
single meaningful variable; at other times they entangle the representation of many …