PowerMLP: An Efficient Version of KAN

Ruichen Qiu; Yibo Miao; Shiwen Wang; Yifan Zhu; Lijia Yu; Xiao-Shan Gao

doi:10.1609/aaai.v39i19.34210

Authors

Ruichen Qiu School of Advanced Interdisciplinary Sciences, UCAS, Beijing 100049, China Academy of Mathematics and Systems Science, CAS, Beijing 100190, China
Yibo Miao Academy of Mathematics and Systems Science, CAS, Beijing 100190, China University of Chinese Academy of Sciences, Beijing 101408, China
Shiwen Wang University of Chinese Academy of Sciences, Beijing 101408, China
Yifan Zhu Academy of Mathematics and Systems Science, CAS, Beijing 100190, China University of Chinese Academy of Sciences, Beijing 101408, China
Lijia Yu Institute of Software, CAS, Beijing 100190, China State Key Laboratory of Computer Science
Xiao-Shan Gao Academy of Mathematics and Systems Science, CAS, Beijing 100190, China University of Chinese Academy of Sciences, Beijing 101408, China

DOI:

https://doi.org/10.1609/aaai.v39i19.34210

Abstract

The Kolmogorov-Arnold Network (KAN) is a new network architecture known for its high accuracy in several tasks such as function fitting and PDE solving. The superior expressive capability of KAN arises from the Kolmogorov-Arnold representation theorem and learnable spline functions. However, the computation of spline functions involves multiple iterations, which renders KAN significantly slower than MLP, thereby increasing the cost associated with model training and deployment. The authors of KAN also noted that "the biggest bottleneck of KANs lies in their slow training. KANs are usually 10x slower than MLPs, given the same number of parameters." To address this issue, we propose a novel MLP-type neural network PowerMLP that employs simpler non-iterative spline function representation, offering approximately the same training time as MLP while theoretically demonstrating stronger expressive power than KAN. Furthermore, we compare the FLOPs of KAN and PowerMLP, quantifying the faster computation speed of PowerMLP. Our comprehensive experiments demonstrate that PowerMLP generally achieves higher accuracy and a training speed about 40 times faster than KAN in various tasks.

PowerMLP: An Efficient Version of KAN

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information