Multi-channels CNN temporal features for depth-based action recognition

Jacek Trelinski; Bogdan Kwolek

doi:10.1117/12.2559432

31 January 2020 Multi-channels CNN temporal features for depth-based action recognition

Jacek Trelinski, Bogdan Kwolek

Proceedings Volume 11433, Twelfth International Conference on Machine Vision (ICMV 2019); 114330U (2020) https://doi.org/10.1117/12.2559432
Event: Twelfth International Conference on Machine Vision, 2019, Amsterdam, Netherlands

Abstract

In this paper, we investigate temporal features that are extracted by a multi-channel convolutional neural network in depth map-based human action recognition. At the beginning, for the non-zero pixels representing the person shape in each depth map we calculate handcrafted features. On multivariate time-series of such handcrafted features we train a multi-class, multi-channel CNN to model temporal features as well as we extract statistical features of time-series. The concatenated features are stored in a common feature vector. Afterwards, for each class we train a separate one-against-all convolutional neural network to extract class-specific features of depth maps. For each class-specific, multivariate time-series we calculate statistical features of time-series. Finally, each class-specific feature vector is concatenated with the common feature vector resulting in an action feature vector. For each action represented by action feature vectors we train a multi-class classifier with one-hot encoding of output labels. The recognition of the action is done by a voting-based ensemble operating on such one-hot encodings. We demonstrate experimentally that on UTD-MHAD dataset the proposed algorithm outperforms state-of-the-art depth-based algorithms and attains promising results on MSR-Action3D dataset.

Citation Download Citation

Jacek Trelinski and Bogdan Kwolek "Multi-channels CNN temporal features for depth-based action recognition", Proc. SPIE 11433, Twelfth International Conference on Machine Vision (ICMV 2019), 114330U (31 January 2020); https://doi.org/10.1117/12.2559432

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available