Dimension Reduction for Symbolic Regression

Authors

  • Paul Kahlmeyer Friedrich-Schiller Universität Jena
  • Markus Fischer Friedrich-Schiller Universität Jena
  • Joachim Giesen Friedrich-Schiller Universität Jena

DOI:

https://doi.org/10.1609/aaai.v39i17.33947

Abstract

Solutions of symbolic regression problems are expressions that are composed of input variables and operators from a finite set of function symbols. One measure for evaluating symbolic regression algorithms is their ability to recover formulae, up to symbolic equivalence, from finite samples. Not unexpectedly, the recovery problem becomes harder when the formula gets more complex, that is, when the number of variables and operators gets larger. Variables in naturally occurring symbolic formulas often appear only in fixed combinations. This can be exploited in symbolic regression by substituting one new variable for the combination, effectively reducing the number of variables. However, finding valid substitutions is challenging. Here, we address this challenge by searching over the expression space of small substitutions and testing for validity. The validity test is reduced to a test of functional dependence. The resulting iterative dimension reduction procedure can be used with any symbolic regression approach. We show that it reliably identifies valid substitutions and significantly boosts the performance of different types of state-of-the-art symbolic regression algorithms.

Downloads

Published

2025-04-11

How to Cite

Kahlmeyer, P., Fischer, M., & Giesen, J. (2025). Dimension Reduction for Symbolic Regression. Proceedings of the AAAI Conference on Artificial Intelligence, 39(17), 17707–17714. https://doi.org/10.1609/aaai.v39i17.33947

Issue

Section

AAAI Technical Track on Machine Learning III