TY - JOUR
T1 - QIIS
T2 - Quaternion Inter- and Intra-Task Interaction Strategy for Multi-Task Dense Prediction
AU - Huang, Kai
AU - Huang, Guoheng
AU - Li, Yan
AU - Li, Ming
AU - Ling, Wing Kuen
AU - Yuan, Xiaochen
AU - Pun, Chi Man
AU - Cheng, Lianglun
N1 - Publisher Copyright:
© 1975-2011 IEEE.
PY - 2025
Y1 - 2025
N2 - In the domain of intelligent consumer electronics, edge devices such as monocular cameras increasingly demand the ability to perform multiple visual perception tasks simultaneously, including semantic segmentation, depth estimation, and boundary detection. However, most existing multi-task dense prediction frameworks suffer from inefficient task interaction and limited feature integration, which significantly constrain their applicability in resource-constrained consumer environments. To address these limitations, we propose a Quaternion Interand Intra-task Interaction Strategy (QIIS), a unified framework designed to enhance task collaboration through quaternion-based feature fusion. Specifically, we introduce a Dual-stream Patch Embedding (DPE) module to extract rich local features, a Crossprompt Quaternion Interaction (CQI) block to achieve efficient inter-task communication, and an Intra-Task Quaternion Fusion (ITQF) module that enables effective fusion of task-generic, taskspecific, and cross-task representations. By leveraging the Hamilton product inherent in quaternion convolution, our approach enhances representational capacity while reducing parameter complexity. Experimental results on the PASCAL-Context and NYUD-v2 benchmarks demonstrate that QIIS achieves competitive performance across multiple tasks, with parameter and FLOPs counts that are generally lower than those of state-ofthe-art methods, making it a viable solution for vision-based measurement and perception in embedded consumer systems.
AB - In the domain of intelligent consumer electronics, edge devices such as monocular cameras increasingly demand the ability to perform multiple visual perception tasks simultaneously, including semantic segmentation, depth estimation, and boundary detection. However, most existing multi-task dense prediction frameworks suffer from inefficient task interaction and limited feature integration, which significantly constrain their applicability in resource-constrained consumer environments. To address these limitations, we propose a Quaternion Interand Intra-task Interaction Strategy (QIIS), a unified framework designed to enhance task collaboration through quaternion-based feature fusion. Specifically, we introduce a Dual-stream Patch Embedding (DPE) module to extract rich local features, a Crossprompt Quaternion Interaction (CQI) block to achieve efficient inter-task communication, and an Intra-Task Quaternion Fusion (ITQF) module that enables effective fusion of task-generic, taskspecific, and cross-task representations. By leveraging the Hamilton product inherent in quaternion convolution, our approach enhances representational capacity while reducing parameter complexity. Experimental results on the PASCAL-Context and NYUD-v2 benchmarks demonstrate that QIIS achieves competitive performance across multiple tasks, with parameter and FLOPs counts that are generally lower than those of state-ofthe-art methods, making it a viable solution for vision-based measurement and perception in embedded consumer systems.
KW - Feature Fusion
KW - Multi-task learning
KW - Quaternion Neural Networks
KW - Transformer
UR - https://www.scopus.com/pages/publications/105024082453
U2 - 10.1109/TCE.2025.3641026
DO - 10.1109/TCE.2025.3641026
M3 - Article
AN - SCOPUS:105024082453
SN - 0098-3063
JO - IEEE Transactions on Consumer Electronics
JF - IEEE Transactions on Consumer Electronics
ER -