Convolutional Spatio-Temporal Prediction Network with Trainable Positional Encoding

Yuan Xu, Yi Zhou Zhang, Da Zhi Sun, Kai Li, Qun Xiong Zhu, Wei Ke, Yang Zhang, Ming Qing Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Existing spatio-temporal prediction networks that rely on recurrent neural networks face significant parallelization challenges, leading to high computational costs and prolonged training durations. In contrast, recurrent-free methods, particularly convolutional neural networks (CNNs), have attracted significant attention because of their lightweight models and straightforward architectures. Traditional CNNs, however, are limited by their inherent weight-sharing property, which imposes a strong inductive bias and restricts their ability to capture spatial heterogeneity. To overcome these challenges, we propose a novel trainable positional encoding convolutional neural network (TPE-CNN). TPE-CNN incorporates a positional encoding module to dynamically learn location-specific spatio-temporal correlations and temporal evolution patterns, addressing the inability of CNNs to handle absolute positional dependencies effectively. Additionally, we integrate large-kernel convolutions and inception modules, significantly enhancing global information awareness. This design enables the incorporation of global attention mechanisms while maintaining a low parameter count, achieving an optimal balance between performance and computational efficiency. To validate the effectiveness of TPE-CNN, we performed extensive experiments on three real-world datasets. Experimental results demonstrate that TPE-CNN delivers superior prediction accuracy while maintaining low computational complexity.

Original languageEnglish
Title of host publicationProceedings of the 37th Chinese Control and Decision Conference, CCDC 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages406-411
Number of pages6
ISBN (Electronic)9798331510565
DOIs
Publication statusPublished - 2025
Event37th Chinese Control and Decision Conference, CCDC 2025 - Xiamen, China
Duration: 16 May 202519 May 2025

Publication series

NameProceedings of the 37th Chinese Control and Decision Conference, CCDC 2025

Conference

Conference37th Chinese Control and Decision Conference, CCDC 2025
Country/TerritoryChina
CityXiamen
Period16/05/2519/05/25

Keywords

  • Convolutional Neural Networks
  • Spatio-temporal Prediction
  • Trainable Positional Encoding

Fingerprint

Dive into the research topics of 'Convolutional Spatio-Temporal Prediction Network with Trainable Positional Encoding'. Together they form a unique fingerprint.

Cite this