Efficient visual search of videos cast as text retrieval

J Sivic, A Zisserman - IEEE transactions on pattern analysis …, 2008 - ieeexplore.ieee.org
We describe an approach to object retrieval which searches for and localizes all the
occurrences of an object in a video, given a query image of the object. The object is …

3d object modeling and recognition using local affine-invariant image descriptors and multi-view spatial constraints

F Rothganger, S Lazebnik, C Schmid… - International journal of …, 2006 - Springer
This article introduces a novel representation for three-dimensional (3D) objects in terms of
local affine-invariant descriptors of their images and the spatial relationships between the …

A statistical approach to the matching of local features

J Rabin, J Delon, Y Gousseau - SIAM Journal on Imaging Sciences, 2009 - SIAM
This paper focuses on the matching of local features between images. Given a set of query
descriptors and a database of candidate descriptors, the goal is to decide which ones …

Content based video matching using spatiotemporal volumes

A Basharat, Y Zhai, M Shah - Computer Vision and Image Understanding, 2008 - Elsevier
This paper presents a novel framework for matching video sequences using the
spatiotemporal segmentation of videos. Instead of using appearance features for region …

Discrete approximations of Gaussian smoothing and Gaussian derivatives

T Lindeberg - Journal of Mathematical Imaging and Vision, 2024 - Springer
This paper develops an in-depth treatment concerning the problem of approximating the
Gaussian smoothing and the Gaussian derivative computations in scale-space theory for …

Efficient visual search for objects in videos

J Sivic, A Zisserman - Proceedings of the IEEE, 2008 - ieeexplore.ieee.org
We describe an approach to generalize the concept of text-based search to nontextual
information. In particular, we elaborate on the possibilities of retrieving objects or scenes in a …

Object level grouping for video shots

J Sivic, F Schaffalitzky, A Zisserman - International Journal of Computer …, 2006 - Springer
We describe a method for automatically obtaining object representations suitable for
retrieval from generic video shots. The object representation consists of an association of …

Real-time task recognition in cataract surgery videos using adaptive spatiotemporal polynomials

G Quellec, M Lamard, B Cochener… - IEEE transactions on …, 2014 - ieeexplore.ieee.org
This paper introduces a new algorithm for recognizing surgical tasks in real-time in a video
stream. The goal is to communicate information to the surgeon in due time during a video …

Learning visual flows: A Lie algebraic approach

D Lin, E Grimson, J Fisher - 2009 IEEE Conference on …, 2009 - ieeexplore.ieee.org
We present a novel method for modeling dynamic visual phenomena, which consists of two
key aspects. First, the integral motion of constituent elements in a dynamic scene is captured …

Motion segmentation of rgb-d sequences: Combining semantic and motion information using statistical inference

S Muthu, R Tennakoon, T Rathnayake… - … on Image Processing, 2020 - ieeexplore.ieee.org
This paper presents an innovative method for motion segmentation in RGB-D dynamic
videos with multiple moving objects. The focus is on finding static, small or slow moving …