GMAMflow: Learning Optical Flow through Global Multi-scale Attention Matching

Jiaxuan Lin, Qiyuan Zhang, Xiaochen Yuan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Optical flow estimation is essential for various computer vision applications, including autonomous vehicles, object tracking, and frame interpolation. However, enhancing the accuracy of optical flow algorithms while optimizing their structural complexity continues to be a formidable challenge. This paper presents GMAMflow, a groundbreaking optical flow estimation model that leverages a global multi-scale attention mechanism. Our enhancement of the existing EMA allows the network to more effectively capture intricate details across diverse scales within complex imagery. The architectural refinements to the GMflow framework are specifically designed to elevate the learning process of optical flow. Comprehensive experimental validation using the Sintel dataset demonstrates that our approach achieves higher accuracy and outperforms competing methods. These results signify a significant stride in optical flow research, offering not only theoretical contributions but also practical implications for the advancement of technology in associated domains.

Original languageEnglish
Title of host publicationSixteenth International Conference on Signal Processing Systems, ICSPS 2024
EditorsRobert Minasian, Li Chai
PublisherSPIE
ISBN (Electronic)9781510689251
DOIs
Publication statusPublished - 2025
Event16th International Conference on Signal Processing Systems, ICSPS 2024 - Kunming, China
Duration: 15 Nov 202417 Nov 2024

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume13559
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

Conference16th International Conference on Signal Processing Systems, ICSPS 2024
Country/TerritoryChina
CityKunming
Period15/11/2417/11/24

Keywords

  • attention mechanism
  • deep learning
  • motion estimation
  • neural network
  • Optical flow

Fingerprint

Dive into the research topics of 'GMAMflow: Learning Optical Flow through Global Multi-scale Attention Matching'. Together they form a unique fingerprint.

Cite this