Efficient visual search of videos cast as text retrieval
J Sivic, A Zisserman - IEEE transactions on pattern analysis …, 2008 - ieeexplore.ieee.org
We describe an approach to object retrieval which searches for and localizes all the
occurrences of an object in a video, given a query image of the object. The object is …
occurrences of an object in a video, given a query image of the object. The object is …
3d object modeling and recognition using local affine-invariant image descriptors and multi-view spatial constraints
This article introduces a novel representation for three-dimensional (3D) objects in terms of
local affine-invariant descriptors of their images and the spatial relationships between the …
local affine-invariant descriptors of their images and the spatial relationships between the …
A statistical approach to the matching of local features
This paper focuses on the matching of local features between images. Given a set of query
descriptors and a database of candidate descriptors, the goal is to decide which ones …
descriptors and a database of candidate descriptors, the goal is to decide which ones …
Content based video matching using spatiotemporal volumes
This paper presents a novel framework for matching video sequences using the
spatiotemporal segmentation of videos. Instead of using appearance features for region …
spatiotemporal segmentation of videos. Instead of using appearance features for region …
Discrete approximations of Gaussian smoothing and Gaussian derivatives
T Lindeberg - Journal of Mathematical Imaging and Vision, 2024 - Springer
This paper develops an in-depth treatment concerning the problem of approximating the
Gaussian smoothing and the Gaussian derivative computations in scale-space theory for …
Gaussian smoothing and the Gaussian derivative computations in scale-space theory for …
Efficient visual search for objects in videos
J Sivic, A Zisserman - Proceedings of the IEEE, 2008 - ieeexplore.ieee.org
We describe an approach to generalize the concept of text-based search to nontextual
information. In particular, we elaborate on the possibilities of retrieving objects or scenes in a …
information. In particular, we elaborate on the possibilities of retrieving objects or scenes in a …
Object level grouping for video shots
We describe a method for automatically obtaining object representations suitable for
retrieval from generic video shots. The object representation consists of an association of …
retrieval from generic video shots. The object representation consists of an association of …
Real-time task recognition in cataract surgery videos using adaptive spatiotemporal polynomials
G Quellec, M Lamard, B Cochener… - IEEE transactions on …, 2014 - ieeexplore.ieee.org
This paper introduces a new algorithm for recognizing surgical tasks in real-time in a video
stream. The goal is to communicate information to the surgeon in due time during a video …
stream. The goal is to communicate information to the surgeon in due time during a video …
Learning visual flows: A Lie algebraic approach
We present a novel method for modeling dynamic visual phenomena, which consists of two
key aspects. First, the integral motion of constituent elements in a dynamic scene is captured …
key aspects. First, the integral motion of constituent elements in a dynamic scene is captured …
Motion segmentation of rgb-d sequences: Combining semantic and motion information using statistical inference
This paper presents an innovative method for motion segmentation in RGB-D dynamic
videos with multiple moving objects. The focus is on finding static, small or slow moving …
videos with multiple moving objects. The focus is on finding static, small or slow moving …