PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Based Scalable Typography

Authors

  • Junle Liu South China University of Technology INTSIG-SCUT Joint Lab on Document Analysis and Recognition
  • Peirong Zhang South China University of Technology
  • Yuyi Zhang South China University of Technology INTSIG-SCUT Joint Lab on Document Analysis and Recognition
  • Pengyu Yan South China University of Technology
  • Hui Zhou Intsig Information Co. Ltd INTSIG-SCUT Joint Lab on Document Analysis and Recognition
  • Xinyue Zhou Intsig Information Co. Ltd INTSIG-SCUT Joint Lab on Document Analysis and Recognition
  • Fengjun Guo Intsig Information Co. Ltd INTSIG-SCUT Joint Lab on Document Analysis and Recognition
  • Lianwen Jin South China University of Technology INTSIG-SCUT Joint Lab on Document Analysis and Recognition

DOI:

https://doi.org/10.1609/aaai.v40i9.37656

Abstract

Commercial-grade poster design demands the seamless integration of aesthetic appeal with precise, informative content delivery. Current automated poster generation systems face significant limitations, including incomplete design workflows, poor text rendering accuracy, and insufficient flexibility for commercial applications. To address these challenges, we propose PosterVerse, a full-workflow, commercial-grade poster generation method that seamlessly automates the entire design process while delivering high-density and scalable text rendering. PosterVerse replicates professional design through three key stages: (1) blueprint creation using fine-tuned LLMs to extract key design elements from user requirements, (2) graphical background generation via customized diffusion models to create visually appealing imagery, and (3) unified layout-text rendering with an MLLM-powered HTML engine to guarantee high text accuracy and flexible customization. In addition, we introduce PosterDNA, a commercial-grade, HTML-based dataset tailored for training and validating poster design models. To the best of our knowledge, PosterDNA is the first Chinese poster generation dataset to introduce HTML typography files, enabling scalable text rendering and fundamentally solving the challenges of rendering small and high-density text. Experimental results demonstrate that PosterVerse consistently produces commercial-grade posters with appealing visuals, accurate text alignment, and customizable layouts, making it a promising solution for automating commercial poster design.

Published

2026-03-14

How to Cite

Liu, J., Zhang, P., Zhang, Y., Yan, P., Zhou, H., Zhou, X., … Jin, L. (2026). PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Based Scalable Typography. Proceedings of the AAAI Conference on Artificial Intelligence, 40(9), 7197–7205. https://doi.org/10.1609/aaai.v40i9.37656

Issue

Section

AAAI Technical Track on Computer Vision VI