TY - GEN
T1 - 3D-ConvLSTMNet
T2 - 23rd IEEE International Conference on Mobile Data Management, MDM 2022
AU - He, Lihua
AU - Luo, Wuman
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Spatiotemporal correlations are crucial for traffic flow prediction. So far, various traffic flow prediction methods based on convolutional neural network (CNN) and long short-term memory (LSTM) network have been proposed. However, the common CNN - based models cannot preserve the temporal information after the first layer. Although the 3D CNN-based models can effectively capture short-term spatial and tempo-ral features, they are not suitable for long-term information capturing. LSTM is excellent at long-term features extraction. However, it alone cannot be used for spatial information extraction. To address these issues, we propose a deep architecture called 3D-ConvLSTMNet to better capture the spatiotemporal correlations among the traffic data. Specifically, we proposed a short-long term spatiotemporal feature extraction module called 3D-ConvLSTM, which uses 3D CNN to extract short-term spatiotemporal correlations, and uses ConvLSTM to extract the long-term spatiotemporal correlations. To get the long-distance spatial features, we adopt the residual neural network to develop the depth of 3D-ConvLSTMNet. Finally, we utilize a channel-wise attention mechanism to quantify the contribution of each grid in space domain. To evaluate the performances of ConvLSTMNet, we conduct extensive experiments on two real-world datasets. The experiment results show that our model gets better performances than the other state-of-the-art methods.
AB - Spatiotemporal correlations are crucial for traffic flow prediction. So far, various traffic flow prediction methods based on convolutional neural network (CNN) and long short-term memory (LSTM) network have been proposed. However, the common CNN - based models cannot preserve the temporal information after the first layer. Although the 3D CNN-based models can effectively capture short-term spatial and tempo-ral features, they are not suitable for long-term information capturing. LSTM is excellent at long-term features extraction. However, it alone cannot be used for spatial information extraction. To address these issues, we propose a deep architecture called 3D-ConvLSTMNet to better capture the spatiotemporal correlations among the traffic data. Specifically, we proposed a short-long term spatiotemporal feature extraction module called 3D-ConvLSTM, which uses 3D CNN to extract short-term spatiotemporal correlations, and uses ConvLSTM to extract the long-term spatiotemporal correlations. To get the long-distance spatial features, we adopt the residual neural network to develop the depth of 3D-ConvLSTMNet. Finally, we utilize a channel-wise attention mechanism to quantify the contribution of each grid in space domain. To evaluate the performances of ConvLSTMNet, we conduct extensive experiments on two real-world datasets. The experiment results show that our model gets better performances than the other state-of-the-art methods.
KW - 3D CNN
KW - ConvLSTM
KW - Traffic flow prediction
KW - channel-wise attention
KW - residual neural network
UR - http://www.scopus.com/inward/record.url?scp=85137580778&partnerID=8YFLogxK
U2 - 10.1109/MDM55031.2022.00041
DO - 10.1109/MDM55031.2022.00041
M3 - Conference contribution
AN - SCOPUS:85137580778
T3 - Proceedings - IEEE International Conference on Mobile Data Management
SP - 147
EP - 152
BT - Proceedings - 2022 23rd IEEE International Conference on Mobile Data Management, MDM 2022
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 6 June 2022 through 9 June 2022
ER -