Selecting Effective and Discriminative Spatio-Temporal Interest Points for Recognizing Human Action

Hongbo ZHANG  Shaozi LI  Songzhi SU  Shu-Yuan CHEN  

IEICE TRANSACTIONS on Information and Systems   Vol.E96-D   No.8   pp.1783-1792
Publication Date: 2013/08/01
Online ISSN: 1745-1361
DOI: 10.1587/transinf.E96.D.1783
Print ISSN: 0916-8532
Type of Manuscript: PAPER
Category: Image Processing and Video Processing
human action recognition,  discriminative power,  ε-NN probability estimation,  class likelihood probability,  variance filter,  

Full Text: PDF>>
Buy this Article

Many successful methods for recognizing human action are spatio-temporal interest point (STIP) based methods. Given a test video sequence, for a matching-based method using a voting mechanism, each test STIP casts a vote for each action class based on its mutual information with respect to the respective class, which is measured in terms of class likelihood probability. Therefore, two issues should be addressed to improve the accuracy of action recognition. First, effective STIPs in the training set must be selected as references for accurately estimating probability. Second, discriminative STIPs in the test set must be selected for voting. This work uses ε-nearest neighbors as effective STIPs for estimating the class probability and uses a variance filter for selecting discriminative STIPs. Experimental results verify that the proposed method is more accurate than existing action recognition methods.