- 学术资源搜索

[PDF] royalsocietypublishing.org Full View

Inductive biases for deep learning of higher-level cognition

A Goyal, Y Bengio - Proceedings of the Royal Society A, 2022 - royalsocietypublishing.org

A fascinating hypothesis is that human and animal intelligence could be explained by a few
principles (rather than an encyclopaedic list of heuristics). If that hypothesis was correct, we …

被引用次数：383 相关文章所有 5 个版本

[PDF] arxiv.org

Predrnn: A recurrent neural network for spatiotemporal predictive learning

Y Wang, H Wu, J Zhang, Z Gao, J Wang… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org

The predictive learning of spatiotemporal sequences aims to generate future images by
learning from the historical context, where the visual dynamics are believed to have modular …

被引用次数：393 相关文章所有 6 个版本

[PDF] neurips.cc

Neural production systems

AG ALIAS PARTH GOYAL, A Didolkar… - Advances in …, 2021 - proceedings.neurips.cc

Visual environments are structured, consisting of distinct objects or entities. These entities
have properties---visible or latent---that determine the manner in which they interact with one …

被引用次数：87 相关文章所有 9 个版本

[PDF] neurips.cc

Simone: View-invariant, temporally-abstracted object representations via unsupervised video decomposition

R Kabra, D Zoran, G Erdogan… - Advances in …, 2021 - proceedings.neurips.cc

To help agents reason about scenes in terms of their building blocks, we wish to extract the
compositional structure of any given scene (in particular, the configuration and …

被引用次数：74 相关文章所有 7 个版本

[PDF] thecvf.com

Parts: Unsupervised segmentation with slots, attention and independence maximization

D Zoran, R Kabra, A Lerchner… - Proceedings of the …, 2021 - openaccess.thecvf.com

From an early age, humans perceive the visual world as composed of coherent objects with
distinctive properties such as shape, size, and color. There is great interest in building …

被引用次数：49 相关文章所有 3 个版本

[PDF] neurips.cc

Iso-dream: Isolating and leveraging noncontrollable visual dynamics in world models

M Pan, X Zhu, Y Wang, X Yang - Advances in neural …, 2022 - proceedings.neurips.cc

World models learn the consequences of actions in vision-based interactive systems.
However, in practical scenarios such as autonomous driving, there commonly exists …

被引用次数：33 相关文章所有 6 个版本

[PDF] arxiv.org

Guess what moves: Unsupervised video and image segmentation by anticipating motion

S Choudhury, L Karazija, I Laina, A Vedaldi… - arXiv preprint arXiv …, 2022 - arxiv.org

Motion, measured via optical flow, provides a powerful cue to discover and learn objects in
images and videos. However, compared to using appearance, it has some blind spots, such …

被引用次数：35 相关文章所有 8 个版本

[PDF] neurips.cc

Unsupervised multi-object segmentation by predicting probable motion patterns

L Karazija, S Choudhury, I Laina… - Advances in …, 2022 - proceedings.neurips.cc

We propose a new approach to learn to segment multiple image objects without manual
supervision. The method can extract objects form still images, but uses videos for …

被引用次数：13 相关文章所有 10 个版本

[PDF] thecvf.com

Intrinsic physical concepts discovery with Object-Centric predictive models

Q Tang, X Zhu, Z Lei, Z Zhang - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

The ability to discover abstract physical concepts and understand how they work in the world
through observing lies at the core of human intelligence. The acquisition of this ability is …

被引用次数：8 相关文章所有 6 个版本

[PDF] arxiv.org

Compositional scene representation learning via reconstruction: A survey

J Yuan, T Chen, B Li, X Xue - IEEE Transactions on Pattern …, 2023 - ieeexplore.ieee.org

Visual scenes are composed of visual concepts and have the property of combinatorial
explosion. An important reason for humans to efficiently learn from diverse visual scenes is …

被引用次数：25 相关文章所有 6 个版本