Moonshine: Distilling Game Content Generators into Steerable Generative Models

Authors

  • Yuhe Nie New York University
  • Michael Middleton New York University
  • Tim Merino New York University
  • Nidhushan Kanagaraja New York University
  • Ashutosh Kumar New York University
  • Zhan Zhuang City University of Hong Kong Southern University of Science and Technology
  • Julian Togelius New York University

DOI:

https://doi.org/10.1609/aaai.v39i13.33571

Abstract

Procedural Content Generation via Machine Learning (PCGML) has enhanced game content creation, yet challenges in controllability and limited training data persist. This study addresses these issues by distilling a constructive PCG algorithm into a controllable PCGML model. We first generate a large amount of content with a constructive algorithm and label it using a Large Language Model (LLM). We use these synthetic labels to condition two PCGML models for content-specific generation, a diffusion model and the five-dollar model. This neural network distillation process ensures that the generation aligns with the original algorithm while introducing controllability through plain text. We define this text-conditioned PCGML as a Text-to-game-Map (T2M) task, offering an alternative to prevalent text-to-image multi-modal tasks. We compare our distilled models with the baseline constructive algorithm. Our analysis of the variety, accuracy, and quality of our generation demonstrates the efficacy of distilling constructive methods into controllable text-conditioned PCGML models.

Published

2025-04-11

How to Cite

Nie, Y., Middleton, M., Merino, T., Kanagaraja, N., Kumar, A., Zhuang, Z., & Togelius, J. (2025). Moonshine: Distilling Game Content Generators into Steerable Generative Models. Proceedings of the AAAI Conference on Artificial Intelligence, 39(13), 14344–14351. https://doi.org/10.1609/aaai.v39i13.33571

Issue

Section

AAAI Technical Track on Humans and AI