跳至主導覽 跳至搜尋 跳過主要內容

Single Cross-domain Semantic Guidance Network for Multimodal Unsupervised Image Translation

  • Jiaying Lan
  • , Lianglun Cheng
  • , Guoheng Huang
  • , Chi Man Pun
  • , Xiaochen Yuan
  • , Shangyu Lai
  • , Hong Rui Liu
  • , Wing Kuen Ling
  • Guangdong University of Technology
  • University of Macau
  • University of Maryland, College Park
  • San Jose State University

研究成果: Conference contribution同行評審

1 引文 斯高帕斯(Scopus)

摘要

Multimodal image-to-image translation has received great attention due to its flexibility and practicality. The existing methods lack the generality of effective style representation, and cannot capture different levels of stylistic semantic information from cross-domain images. Besides, they ignore the parallelism for cross-domain image generation, and their generator can only be responsible for specific domains. To address these issues, we propose a novel Single Cross-domain Semantic Guidance Network (SCSG-Net) for coarse-to-fine semantically controllable multimodal image translation. Images from different domains are mapped to a unified visual semantic latent space by a dual sparse feature pyramid encoder, and then the generative module generates the result images by extracting semantic style representation from the input images in a self-supervised manner guided by adaptive discrimination. Especially, our SCSG-Net meets the needs of users in different styles as well as diverse scenarios. Extensive experiments on different benchmark datasets show that our method can outperform other state-of-the-art methods both quantitatively and qualitatively.

原文English
主出版物標題MultiMedia Modeling - 29th International Conference, MMM 2023, Proceedings
編輯Duc-Tien Dang-Nguyen, Cathal Gurrin, Alan F. Smeaton, Martha Larson, Stevan Rudinac, Minh-Son Dao, Christoph Trattner, Phoebe Chen
發行者Springer Science and Business Media Deutschland GmbH
頁面165-177
頁數13
ISBN(列印)9783031270765
DOIs
出版狀態Published - 2023
事件29th International Conference on MultiMedia Modeling, MMM 2023 - Bergen, Norway
持續時間: 9 1月 202312 1月 2023

出版系列

名字Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
13833 LNCS
ISSN(列印)0302-9743
ISSN(電子)1611-3349

Conference

Conference29th International Conference on MultiMedia Modeling, MMM 2023
國家/地區Norway
城市Bergen
期間9/01/2312/01/23

指紋

深入研究「Single Cross-domain Semantic Guidance Network for Multimodal Unsupervised Image Translation」主題。共同形成了獨特的指紋。

引用此