Script-Generated Picture Book Technology Based on Large Language Models and AIGC

Dejiang Wang, Zhuoran Zhai, Ngai Cheong, Li Peng

研究成果: Conference contribution同行評審

摘要

This paper mainly discusses how to use the large language models such as GPT and Ernie model combined with the AIGC tools represented by stable diffusion, which uses a random story script to generate images with fixed style, character characteristics, and continuous plots. The article provides a detailed introduction to how to build an assembly line, using a large language model and a story script to generate the prompt words required for stable diffusion. Subsequently, by comparing the characteristics of traditional picture book production and the image results of using language models word prompts, summarize the limitations of text to images. This leads to a supervised multi round iterative LoRA model scheme that utilizes the CLIP to achieve character IP fixation. Simultaneously using the ControlNet model and inpainting to preprocess and reprocess the image can achieve controllable character poses and fixed backgrounds in the picture book. Finally, we will evaluate and summarize the new scheme and analyze its strengths in picture book creation accordingly.

原文English
主出版物標題ICDTE 2023 - 2023 7th International Conference on Digital Technology in Education
發行者Association for Computing Machinery
頁面104-108
頁數5
ISBN(電子)9798400708527
DOIs
出版狀態Published - 8 9月 2023
事件7th International Conference on Digital Technology in Education, ICDTE 2023 - Virtual, Online, China
持續時間: 8 9月 202310 9月 2023

出版系列

名字ACM International Conference Proceeding Series

Conference

Conference7th International Conference on Digital Technology in Education, ICDTE 2023
國家/地區China
城市Virtual, Online
期間8/09/2310/09/23

指紋

深入研究「Script-Generated Picture Book Technology Based on Large Language Models and AIGC」主題。共同形成了獨特的指紋。

引用此