Collective Activity Recognition by Attribute-Based Spatio-Temporal Descriptor

Changhong CHEN  Hehe DOU  Zongliang GAN  

IEICE TRANSACTIONS on Information and Systems   Vol.E98-D    No.10    pp.1875-1878
Publication Date: 2015/10/01
Publicized: 2015/07/22
Online ISSN: 1745-1361
DOI: 10.1587/transinf.2015EDL8108
Type of Manuscript: LETTER
Category: Pattern Recognition
collective activity recognition,  spatio-temporal descriptor,  attribute-based spatio-temporal descriptor,  

Full Text: PDF>>
Buy this Article

Collective activity recognition plays an important role in high-level video analysis. Most current feature representations look at contextual information extracted from the behaviour of nearby people. Every person needs to be detected and his pose should be estimated. After extracting the feature, hierarchical graphical models are always employed to model the spatio-temporal patterns of individuals and their interactions, and so can not avoid complex preprocessing and inference operations. To overcome these drawbacks, we present a new feature representation method, called attribute-based spatio-temporal (AST) descriptor. First, two types of information, spatio-temporal (ST) features and attribute features, are exploited. Attribute-based features are manually specified. An attribute classifier is trained to model the relationship between the ST features and attribute-based features, according to which the attribute features are refreshed. Then, the ST features, attribute features and the relationship between the attributes are combined to form the AST descriptor. An objective classifier can be specified on the AST descriptor and the weight parameters of the classifier are used for recognition. Experiments on standard collective activity benchmark sets show the effectiveness of the proposed descriptor.