跳至主導覽 跳至搜尋 跳過主要內容

FOPS-V: Feature-Aware Optimization and Parallel Scale Fusion for 3D Human Reconstruction in Video

  • Yang Huang
  • , Guoheng Huang
  • , Lianglun Cheng
  • , Yejing Huo
  • , Xuhang Chen
  • , Xiaochen Yuan
  • , Guo Zhong
  • , Chi Man Pun

研究成果: Conference contribution同行評審

摘要

Video-based 3D human reconstruction, a fundamental task in computer vision, aims to accurately estimate the 3D pose and shape of the human body from video sequences. While recent methods leverage spatial and temporal feature extraction techniques, many remain limited by single-scale processing, hindering their performance in complex scenes. Additionally, challenges such as occlusion and complex poses often lead to inaccurate reconstructions. To address these limitations, we propose FOPS-V: Feature-aware Optimization and Parallel Scale Fusion for 3D Human Reconstruction in Video. Our approach comprises three key components: a Feature-Aware Optimization (FAO) block, a Parallel Scale-Aware Attention (PSAA) block, and a Normalized Feature-Aware Representation (NFAR) guided by Feature-Response Layer Normalization (FRLN). The FAO block enhances feature extraction by optimizing joint and mesh vertex representations through the fusion of image features and learned query vectors. The PSAA block performs subscale feature extraction for joint and mesh vertices and fuses multiscale feature information to improve pose and shape representations. Guided by FRLN, the NFAR addresses instability caused by variations in feature statistics within the FAO and PSAA blocks. This normalization, with an adaptable threshold, enhances robustness to noisy or outlier data, preventing performance degradation. Extensive evaluations on the 3DPW, MPI-INF-3DHP, and Human3.6M datasets demonstrate that FOPS-V outperforms state-of-the-art methods, highlighting its effectiveness for 3D human reconstruction in video.

原文English
主出版物標題Neural Information Processing - 31st International Conference, ICONIP 2024, Proceedings
編輯Mufti Mahmud, Maryam Doborjeh, Kevin Wong, Andrew Chi Sing Leung, Zohreh Doborjeh, M. Tanveer
發行者Springer Science and Business Media Deutschland GmbH
頁面180-194
頁數15
ISBN(列印)9789819665983
DOIs
出版狀態Published - 2025
事件31st International Conference on Neural Information Processing, ICONIP 2024 - Auckland, New Zealand
持續時間: 2 12月 20246 12月 2024

出版系列

名字Lecture Notes in Computer Science
15293 LNCS
ISSN(列印)0302-9743
ISSN(電子)1611-3349

Conference

Conference31st International Conference on Neural Information Processing, ICONIP 2024
國家/地區New Zealand
城市Auckland
期間2/12/246/12/24

指紋

深入研究「FOPS-V: Feature-Aware Optimization and Parallel Scale Fusion for 3D Human Reconstruction in Video」主題。共同形成了獨特的指紋。

引用此