Variational inference: A review for statisticians

DM Blei, A Kucukelbir, JD McAuliffe - Journal of the American …, 2017 - Taylor & Francis
One of the core problems of modern statistics is to approximate difficult-to-compute
probability densities. This problem is especially important in Bayesian statistics, which …

Self-supervised video object segmentation by motion grouping

C Yang, H Lamdouar, E Lu… - Proceedings of the …, 2021 - openaccess.thecvf.com
Animals have evolved highly functional visual systems to understand motion, assisting
perception even under complex environments. In this paper, we work towards developing a …

Video segmentation via object flow

YH Tsai, MH Yang, MJ Black - Proceedings of the IEEE …, 2016 - openaccess.thecvf.com
Video object segmentation is challenging due to fast moving objects, deforming shapes, and
cluttered backgrounds. Optical flow can be used to propagate an object segmentation over …

Weakly-supervised action localization with background modeling

PX Nguyen, D Ramanan… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
We describe a latent approach that learns to detect actions in long sequences given training
videos with only whole-video class labels. Our approach makes use of two innovations to …

Video (language) modeling: a baseline for generative models of natural videos

MA Ranzato, A Szlam, J Bruna, M Mathieu… - arXiv preprint arXiv …, 2014 - arxiv.org
We propose a strong baseline model for unsupervised feature learning using video data. By
learning to predict missing frames or extrapolate future frames from an input video …

Neural expectation maximization

K Greff, S Van Steenkiste… - Advances in Neural …, 2017 - proceedings.neurips.cc
Many real world tasks such as reasoning and physical interaction require identification and
manipulation of conceptual entities. A first step towards solving these tasks is the automated …

[图书][B] Computer vision: algorithms and applications

R Szeliski - 2022 - books.google.com
Humans perceive the three-dimensional structure of the world with apparent ease. However,
despite all of the recent advances in computer vision research, the dream of having a …

Deformable sprites for unsupervised video decomposition

V Ye, Z Li, R Tucker, A Kanazawa… - Proceedings of the …, 2022 - openaccess.thecvf.com
We describe a method to extract persistent elements of a dynamic scene from an input
video. We represent each scene element as a Deformable Sprite consisting of three …

Beyond pixels: exploring new representations and applications for motion analysis

C Liu - 2009 - dspace.mit.edu
The focus of motion analysis has been on estimating a flow vector for every pixel by
matching intensities. In my thesis, I will explore motion representations beyond the pixel …

A computational approach for obstruction-free photography

T Xue, M Rubinstein, C Liu, WT Freeman - ACM Transactions on …, 2015 - dl.acm.org
We present a unified computational approach for taking photos through reflecting or
occluding elements such as windows and fences. Rather than capturing a single image, we …