HouseTune: Two-Stage Floorplan Generation with LLM Assistance
DOI:
https://doi.org/10.1609/aaai.v40i16.38417Abstract
This paper proposes a two-stage text-to-floorplan generation framework that combines the reasoning capability of Large Language Models (LLMs) with the generative power of diffusion models. In the first stage, we leverage a Chain-of-Thought (CoT) prompting strategy to guide an LLM in generating an initial layout, Layout-Init, from natural language descriptions, which ensures a user-friendly and intuitive design process. However, Layout-Init may lack precise geometric alignment and fine-grained structural details due to the inherent limitations of LLMs. To address this, in the second stage we propose a Dual-Noise Prior-Preserved Diffusion (DNPP-Diffusion) model to refine Layout-Init into a final floorplan that better adheres to physical constraints and user requirements. By combining LLMs and a dedicated refining model, our approach is able to generate high-quality floorplans without requiring large-scale domain-specific training data. Experimental results demonstrate its advantages in comparison with state of the art methods, and validate its effectiveness in home design applications.Downloads
Published
2026-03-14
How to Cite
Zong, Z., Chen, G., Zhan, Z., Yu, F., & Tan, G. (2026). HouseTune: Two-Stage Floorplan Generation with LLM Assistance. Proceedings of the AAAI Conference on Artificial Intelligence, 40(16), 14059–14067. https://doi.org/10.1609/aaai.v40i16.38417
Issue
Section
AAAI Technical Track on Computer Vision XIII