StableYolo: Optimizing Image Generation for Large Language Models

SEARCH-BASED SOFTWARE ENGINEERING, SSBSE 2023(2024)

引用 0|浏览8
暂无评分
摘要
AI-based image generation is bounded by system parameters and the way users define prompts. Both prompt engineering and AI tuning configuration are current open research challenges and they require a significant amount of manual effort to generate good quality images. We tackle this problem by applying evolutionary computation to Stable Diffusion, tuning both prompts and model parameters simultaneously. We guide our search process by using Yolo. Our experiments show that our system, dubbed StableYolo, significantly improves image quality (52% on average compared to the baseline), helps identify relevant words for prompts, reduces the number of GPU inference steps per image (from 100 to 45 on average), and keeps the length of the prompt short (approximate to 7 keywords).
更多
查看译文
关键词
LLMS,SBSE,Image Generation,Stable Diffusion,Yolo
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要