查看文章

hal.science 中的 [PDF]

Image classification with the Fisher vector: Theory and practice

作者

Jorge Sánchez, Florent Perronnin, Thomas Mensink, Jakob Verbeek

发表日期

2013/12/1

期刊

International journal of computer vision (IJCV)

卷号

105

期号

页码范围

222-245

简介

A standard approach to describe an image for classification and retrieval purposes is to extract a set of local patch descriptors, encode them into a high dimensional vector and pool them into an image-level signature. The most common patch encoding strategy consists in quantizing the local descriptors into a finite set of prototypical elements. This leads to the popular Bag-of-Visual words representation. In this work, we propose to use the Fisher Kernel framework as an alternative patch encoding strategy: we describe patches by their deviation from an “universal” generative Gaussian mixture model. This representation, which we call Fisher vector has many advantages: it is efficient to compute, it leads to excellent results even with efficient linear classifiers, and it can be compressed with a minimal loss of accuracy using product quantization. We report experimental results on five standard datasets …

引用总数

被引用次数：1913

20132014201520162017201820192020202120222023202421 105 201 285 280 223 215 179 126 123 85 28

学术搜索中的文章

Image classification with the fisher vector: Theory and practice

J Sánchez, F Perronnin, T Mensink, J Verbeek - International journal of computer vision, 2013

被引用次数：1913 相关文章所有 35 个版本