作者
Matteo Bregonzio, Shaogang Gong, Tao Xiang
发表日期
2009/6/20
研讨会论文
2009 IEEE conference on computer vision and pattern recognition
页码范围
1948-1955
出版商
IEEE
简介
Much of recent action recognition research is based on space-time interest points extracted from video using a Bag of Words (BOW) representation. It mainly relies on the discriminative power of individual local space-time descriptors, whilst ignoring potentially valuable information about the global spatio-temporal distribution of interest points. In this paper, we propose a novel action recognition approach which differs significantly from previous interest points based approaches in that only the global spatiotemporal distribution of the interest points are exploited. This is achieved through extracting holistic features from clouds of interest points accumulated over multiple temporal scales followed by automatic feature selection. Our approach avoids the non-trivial problems of selecting the optimal space-time descriptor, clustering algorithm for constructing a codebook, and selecting codebook size faced by previous …
引用总数
200820092010201120122013201420152016201720182019202020212022202320242732425766605151332627201216173
学术搜索中的文章
M Bregonzio, S Gong, T Xiang - 2009 IEEE conference on computer vision and pattern …, 2009