Discovering modular solutions that generalize compositionally

JH Lee, SS Mannelli, A Saxe - arXiv preprint arXiv:2402.18361, 2024 - arxiv.org

Diverse studies in systems neuroscience begin with extended periods of training known as'
shaping'procedures. These involve progressively studying component parts of more …

被引用次数：5 相关文章所有 3 个版本

[PDF] arxiv.org

Don't Cut Corners: Exact Conditions for Modularity in Biologically Inspired Representations

W Dorrell, K Hsu, L Hollingsworth, JH Lee, J Wu… - arXiv preprint arXiv …, 2024 - arxiv.org

Why do biological and artificial neurons sometimes modularise, each encoding a single
meaningful variable, and sometimes entangle their representation of many variables? In this …

相关文章所有 2 个版本

[PDF] arxiv.org

A Complexity-Based Theory of Compositionality

E Elmoznino, T Jiralerspong, Y Bengio… - arXiv preprint arXiv …, 2024 - arxiv.org

Compositionality is believed to be fundamental to intelligence. In humans, it underlies the
structure of thought, language, and higher-level reasoning. In AI, compositional …

相关文章所有 2 个版本

[PDF] arxiv.org

Flexible task abstractions emerge in linear networks with fast and bounded units

K Sandbrink, JP Bauer, AM Proca, AM Saxe… - arXiv preprint arXiv …, 2024 - arxiv.org

Animals survive in dynamic environments changing at arbitrary timescales, but such data
distribution shifts are a challenge to neural networks. To adapt to change, neural systems …

相关文章所有 2 个版本

[PDF] arxiv.org

Compositional Risk Minimization

D Mahajan, M Pezeshki, I Mitliagkas, K Ahuja… - arXiv preprint arXiv …, 2024 - arxiv.org

In this work, we tackle a challenging and extreme form of subpopulation shift, which is
termed compositional shift. Under compositional shifts, some combinations of attributes are …

相关文章所有 2 个版本

[PDF] arxiv.org

When can transformers compositionally generalize in-context?

S Kobayashi, S Schug, Y Akram, F Redhardt… - arXiv preprint arXiv …, 2024 - arxiv.org

Many tasks can be composed from a few independent components. This gives rise to a
combinatorial explosion of possible tasks, only some of which might be encountered during …

相关文章所有 2 个版本

[PDF] arxiv.org

When does compositional structure yield compositional generalization? A kernel theory

S Lippl, K Stachenfeld - arXiv preprint arXiv:2405.16391, 2024 - arxiv.org

Compositional generalization (the ability to respond correctly to novel combinations of
familiar components) is thought to be a cornerstone of intelligent behavior. Compositionally …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Enhancing Generalization in Sparse Mixture of Experts Models: The Case for Increased Expert Activation in Compositional Tasks

J Zhao - arXiv preprint arXiv:2410.13964, 2024 - arxiv.org

As Transformer models grow in complexity, their ability to generalize to novel, compositional
tasks becomes crucial. This study challenges conventional wisdom about sparse activation …

相关文章所有 2 个版本

[PDF] openreview.net

Modularity in Biologically Inspired Representations Depends on Task Variable Range Independence

W Dorrell, K Hsu, L Hollingsworth, JH Lee, J Wu… - ICML 2024 Workshop on … - openreview.net

Artificial and biological neurons sometimes modularise into disjoint groups each encoding a
single meaningful variable; at other times they entangle the representation of many …