TY - GEN
T1 - A human activity recognition approach based on skeleton extraction and image reconstruction
AU - Chen, Yanbing
AU - Ke, Wei
AU - Chan, Ka Hou
AU - Xiong, Zhang
N1 - Publisher Copyright:
© 2021 ACM.
PY - 2021/6/25
Y1 - 2021/6/25
N2 - Human Activity Recognition (HAR) has wide applications in surveillance, assisted living and robotic systems that interact with people. Traditional methods for HAR usually input source images directly. Various factors of these direct source images can affect the accuracy of recognition, such as heights, weights, poses, angles and whether being obscured or not. Most of the latest research on HAR has mainly focused on the use of 2D skeleton data, extracted from videos captured by standard cameras. Inspired by this trend, we propose a novel way of using skeleton data in this paper. As a pre-processing phase, we adopt the OpenPose library to extract 2D positions of human skeleton joints, then reconstruct, or synthesize, the skeleton image from these joints, together with coloring to encode the categories of different human parts. The reconstructed images are put into a Convolutional Neural Network (CNN) structure for classification, to obtain the recognition result. We implement the typical four activity classes, i.e., squat, stand, walk and work, in our experiments. Compared with feeding the original complex source images, our approach reduces the complexity of the neural network thus the time consumption, and improves the recognition accuracy significantly. All the images that we used for training and testing are collected from the real public places. We achieved a recognition accuracy of 97.3% in our experiments, indicating that our method has very good performance. Thanks to the lightning speed of the OpenPose library, our method can be carried out in real-time. The demonstration of using synthesized input to bridge existing technologies also has its general significance.
AB - Human Activity Recognition (HAR) has wide applications in surveillance, assisted living and robotic systems that interact with people. Traditional methods for HAR usually input source images directly. Various factors of these direct source images can affect the accuracy of recognition, such as heights, weights, poses, angles and whether being obscured or not. Most of the latest research on HAR has mainly focused on the use of 2D skeleton data, extracted from videos captured by standard cameras. Inspired by this trend, we propose a novel way of using skeleton data in this paper. As a pre-processing phase, we adopt the OpenPose library to extract 2D positions of human skeleton joints, then reconstruct, or synthesize, the skeleton image from these joints, together with coloring to encode the categories of different human parts. The reconstructed images are put into a Convolutional Neural Network (CNN) structure for classification, to obtain the recognition result. We implement the typical four activity classes, i.e., squat, stand, walk and work, in our experiments. Compared with feeding the original complex source images, our approach reduces the complexity of the neural network thus the time consumption, and improves the recognition accuracy significantly. All the images that we used for training and testing are collected from the real public places. We achieved a recognition accuracy of 97.3% in our experiments, indicating that our method has very good performance. Thanks to the lightning speed of the OpenPose library, our method can be carried out in real-time. The demonstration of using synthesized input to bridge existing technologies also has its general significance.
KW - Convolutional neural network
KW - Deep learning
KW - Human activity recognition
KW - OpenPose
UR - http://www.scopus.com/inward/record.url?scp=85118228998&partnerID=8YFLogxK
U2 - 10.1145/3474906.3474909
DO - 10.1145/3474906.3474909
M3 - Conference contribution
AN - SCOPUS:85118228998
T3 - ACM International Conference Proceeding Series
SP - 1
EP - 8
BT - ICGSP 2021 - 5th International Conference on Graphics and Signal Processing
PB - Association for Computing Machinery
T2 - 5th International Conference on Graphics and Signal Processing, ICGSP 2021
Y2 - 25 June 2021 through 27 June 2021
ER -