作者
SM Ali Eslami, Nicolas Heess, Theophane Weber, Yuval Tassa, David Szepesvari, Geoffrey E Hinton
发表日期
2016
研讨会论文
Advances in Neural Information Processing Systems
页码范围
3225-3233
简介
We present a framework for efficient inference in structured image models that explicitly reason about objects. We achieve this by performing probabilistic inference using a recurrent neural network that attends to scene elements and processes them one at a time. Crucially, the model itself learns to choose the appropriate number of inference steps. We use this scheme to learn to perform inference in partially specified 2D models (variable-sized variational auto-encoders) and fully specified 3D models (probabilistic renderers). We show that such models learn to identify multiple objects-counting, locating and classifying the elements of a scene-without any supervision, eg, decomposing 3D images with various numbers of objects in a single forward pass of a neural network at unprecedented speed. We further show that the networks produce accurate inferences when compared to supervised counterparts, and that their structure leads to improved generalization.
引用总数
201620172018201920202021202220232024282853838590888141
学术搜索中的文章
SM Eslami, N Heess, T Weber, Y Tassa, D Szepesvari… - Advances in neural information processing systems, 2016