Segmenting, modeling, and matching video clips containing multiple moving objects

J Sivic, A Zisserman - IEEE transactions on pattern analysis …, 2008 - ieeexplore.ieee.org

We describe an approach to object retrieval which searches for and localizes all the
occurrences of an object in a video, given a query image of the object. The object is …

被引用次数：742 相关文章所有 19 个版本

[PDF] hal.science

3d object modeling and recognition using local affine-invariant image descriptors and multi-view spatial constraints

F Rothganger, S Lazebnik, C Schmid… - International journal of …, 2006 - Springer

This article introduces a novel representation for three-dimensional (3D) objects in terms of
local affine-invariant descriptors of their images and the spatial relationships between the …

被引用次数：580 相关文章所有 31 个版本

[PDF] hal.science

A statistical approach to the matching of local features

J Rabin, J Delon, Y Gousseau - SIAM Journal on Imaging Sciences, 2009 - SIAM

This paper focuses on the matching of local features between images. Given a set of query
descriptors and a database of candidate descriptors, the goal is to decide which ones …

被引用次数：119 相关文章所有 7 个版本

[PDF] psu.edu

Content based video matching using spatiotemporal volumes

A Basharat, Y Zhai, M Shah - Computer Vision and Image Understanding, 2008 - Elsevier

This paper presents a novel framework for matching video sequences using the
spatiotemporal segmentation of videos. Instead of using appearance features for region …

被引用次数：131 相关文章所有 9 个版本

[PDF] springer.com

Discrete approximations of Gaussian smoothing and Gaussian derivatives

T Lindeberg - Journal of Mathematical Imaging and Vision, 2024 - Springer

This paper develops an in-depth treatment concerning the problem of approximating the
Gaussian smoothing and the Gaussian derivative computations in scale-space theory for …

被引用次数：7 相关文章所有 6 个版本

[PDF] psu.edu

Efficient visual search for objects in videos

J Sivic, A Zisserman - Proceedings of the IEEE, 2008 - ieeexplore.ieee.org

We describe an approach to generalize the concept of text-based search to nontextual
information. In particular, we elaborate on the possibilities of retrieving objects or scenes in a …

被引用次数：121 相关文章所有 12 个版本

Object level grouping for video shots

J Sivic, F Schaffalitzky, A Zisserman - International Journal of Computer …, 2006 - Springer

We describe a method for automatically obtaining object representations suitable for
retrieval from generic video shots. The object representation consists of an association of …

被引用次数：118 相关文章所有 10 个版本

[PDF] researchgate.net

Real-time task recognition in cataract surgery videos using adaptive spatiotemporal polynomials

G Quellec, M Lamard, B Cochener… - IEEE transactions on …, 2014 - ieeexplore.ieee.org

This paper introduces a new algorithm for recognizing surgical tasks in real-time in a video
stream. The goal is to communicate information to the surgeon in due time during a video …

被引用次数：55 相关文章所有 7 个版本

[PDF] mit.edu

Learning visual flows: A Lie algebraic approach

D Lin, E Grimson, J Fisher - 2009 IEEE Conference on …, 2009 - ieeexplore.ieee.org

We present a novel method for modeling dynamic visual phenomena, which consists of two
key aspects. First, the integral motion of constituent elements in a dynamic scene is captured …

被引用次数：89 相关文章所有 12 个版本

Motion segmentation of rgb-d sequences: Combining semantic and motion information using statistical inference

S Muthu, R Tennakoon, T Rathnayake… - … on Image Processing, 2020 - ieeexplore.ieee.org

This paper presents an innovative method for motion segmentation in RGB-D dynamic
videos with multiple moving objects. The focus is on finding static, small or slow moving …

被引用次数：22 相关文章所有 9 个版本