TAGE: Trustworthy Attribute Group Editing for Stable Few-shot Image Generation

Ruicheng Zhang, Guoheng Huang, Yejing Huo, Xiaochen Yuan, Zhizhen Zhou, Shiting Wu, Guo Zhong

研究成果: Conference contribution同行評審

摘要

Generative Adversarial Networks (GANs) have emerged as a prominent research focus for image editing tasks, leveraging the powerful image generation capabilities of the GAN framework to produce remarkable results. However, prevailing approaches are contingent upon extensive training datasets and explicit supervision, presenting a significant challenge in manipulating the diverse attributes of new image classes with limited sample availability. To surmount this hurdle, we introduce TAGE, an innovative image generation network comprising three integral modules: the Codebook Learning Module (CLM), the Code Prediction Module (CPM) and the Prompt-driven Semantic Module (PSM). The CPM module delves into the semantic dimensions of category-agnostic attributes, encapsulating them within a discrete codebook. This module is predicated on the concept that images are assemblages of attributes, and thus, by editing these category-independent attributes, it is theoretically possible to generate images from unseen categories. Subsequently, the CPM module facilitates naturalistic image editing by predicting indices of category-independent attribute vectors within the codebook. Additionally, the PSM module generates semantic cues that are seamlessly integrated into the Transformer architecture of the CPM, enhancing the model’s comprehension of the targeted attributes for editing. With these semantic cues, the model can generate images that accentuate desired attributes more prominently while maintaining the integrity of the original category, even with a limited number of samples. We have conducted extensive experiments utilizing the Animal Faces, Flowers, and VGGFaces datasets. The results of these experiments demonstrate that our proposed method not only achieves superior performance but also exhibits a high degree of stability when compared to other few-shot image generation techniques.

原文English
主出版物標題Sixteenth International Conference on Signal Processing Systems, ICSPS 2024
編輯Robert Minasian, Li Chai
發行者SPIE
ISBN(電子)9781510689251
DOIs
出版狀態Published - 2025
事件16th International Conference on Signal Processing Systems, ICSPS 2024 - Kunming, China
持續時間: 15 11月 202417 11月 2024

出版系列

名字Proceedings of SPIE - The International Society for Optical Engineering
13559
ISSN(列印)0277-786X
ISSN(電子)1996-756X

Conference

Conference16th International Conference on Signal Processing Systems, ICSPS 2024
國家/地區China
城市Kunming
期間15/11/2417/11/24

指紋

深入研究「TAGE: Trustworthy Attribute Group Editing for Stable Few-shot Image Generation」主題。共同形成了獨特的指紋。

引用此