AesFA: An Aesthetic Feature-Aware Arbitrary Neural Style Transfer
DOI:
https://doi.org/10.1609/aaai.v38i12.29232Keywords:
ML: Deep Generative Models & Autoencoders, CV: Applications, CV: Computational Photography, Image & Video Synthesis, GeneralAbstract
Neural style transfer (NST) has evolved significantly in recent years. Yet, despite its rapid progress and advancement, existing NST methods either struggle to transfer aesthetic information from a style effectively or suffer from high computational costs and inefficiencies in feature disentanglement due to using pre-trained models. This work proposes a lightweight but effective model, AesFA---Aesthetic Feature-Aware NST. The primary idea is to decompose the image via its frequencies to better disentangle aesthetic styles from the reference image while training the entire model in an end-to-end manner to exclude pre-trained models at inference completely. To improve the network's ability to extract more distinct representations and further enhance the stylization quality, this work introduces a new aesthetic feature: contrastive loss. Extensive experiments and ablations show the approach not only outperforms recent NST methods in terms of stylization quality, but it also achieves faster inference. Codes are available at https://github.com/Sooyyoungg/AesFA.Downloads
Published
2024-03-24
How to Cite
Kwon, J., Kim, S. ., Lin, Y., Yoo, S., & Cha, J. (2024). AesFA: An Aesthetic Feature-Aware Arbitrary Neural Style Transfer. Proceedings of the AAAI Conference on Artificial Intelligence, 38(12), 13310–13319. https://doi.org/10.1609/aaai.v38i12.29232
Issue
Section
AAAI Technical Track on Machine Learning III