HVAdam: A Full-Dimension Adaptive Optimizer

Authors

  • Yiheng Zhang Wuhan University
  • Shaowu Wu Wuhan University
  • Yuanzhuo Xu Wuhan University
  • Jiajun Wu University of Calgary
  • Shang Xu University College London
  • Steve Drew University of Calgary
  • Xiaoguang Niu Wuhan University

DOI:

https://doi.org/10.1609/aaai.v39i21.34421

Abstract

Adaptive optimizers such as Adam and RMSProp have gained attraction in complex neural networks, including generative adversarial networks (GANs) and Transformers, thanks to their stable performance and fast convergence compared to non-adaptive optimizers. A frequently overlooked limitation of adaptive optimizers is that adjusting the learning rate of each dimension individually would ignore the knowledge of the whole loss landscape, resulting in slow updates of parameters, invalidating the learning rate adjustment strategy and eventually leading to widespread insufficient convergence of parameters. In this paper, we propose HVAdam, a novel optimizer that associates all dimensions of the parameters to find a new parameter update direction, leading to a refined parameter update strategy for an increased convergence rate. We validated HVAdam in extensive experiments, showing its faster convergence, higher accuracy, and more stable performance on image classification, image generation, and natural language processing tasks. Particularly, HVAdam achieves a significant improvement on GANs compared with other state-of-the-art methods, especially in Wasserstein-GAN (WGAN) and its improved version with gradient penalty (WGAN-GP).

Downloads

Published

2025-04-11

How to Cite

Zhang, Y., Wu, S., Xu, Y., Wu, J., Xu, S., Drew, S., & Niu, X. (2025). HVAdam: A Full-Dimension Adaptive Optimizer. Proceedings of the AAAI Conference on Artificial Intelligence, 39(21), 22623–22631. https://doi.org/10.1609/aaai.v39i21.34421

Issue

Section

AAAI Technical Track on Machine Learning VII