Online Symbolic Regression with Informative Query

Authors

  • Pengwei Jin State Key Lab of Processors, Institute of Computing Technology, CAS University of Chinese Academy of Sciences Cambricon Technologies
  • Di Huang State Key Lab of Processors, Institute of Computing Technology, CAS University of Chinese Academy of Sciences Cambricon Technologies
  • Rui Zhang State Key Lab of Processors, Institute of Computing Technology, CAS Cambricon Technologies
  • Xing Hu State Key Lab of Processors, Institute of Computing Technology, CAS
  • Ziyuan Nan State Key Lab of Processors, Institute of Computing Technology, CAS University of Chinese Academy of Sciences Cambricon Technologies
  • Zidong Du State Key Lab of Processors, Institute of Computing Technology, CAS
  • Qi Guo State Key Lab of Processors, Institute of Computing Technology, CAS
  • Yunji Chen State Key Lab of Processors, Institute of Computing Technology, CAS University of Chinese Academy of Sciences

DOI:

https://doi.org/10.1609/aaai.v37i4.25641

Keywords:

APP: Other Applications, ML: Applications, ML: Representation Learning

Abstract

Symbolic regression, the task of extracting mathematical expressions from the observed data, plays a crucial role in scientific discovery. Despite the promising performance of existing methods, most of them conduct symbolic regression in an offline setting. That is, they treat the observed data points as given ones that are simply sampled from uniform distributions without exploring the expressive potential of data. However, for real-world scientific problems, the data used for symbolic regression are usually actively obtained by doing experiments, which is an online setting. Thus, how to obtain informative data that can facilitate the symbolic regression process is an important problem that remains challenging. In this paper, we propose QUOSR, a query-based framework for online symbolic regression that can automatically obtain informative data in an iterative manner. Specifically, at each step, QUOSR receives historical data points, generates new x, and then queries the symbolic expression to get the corresponding y, where the (x, y) serves as new data points. This process repeats until the maximum number of query steps is reached. To make the generated data points informative, we implement the framework with a neural network and train it by maximizing the mutual information between generated data points and the target expression. Through comprehensive experiments, we show that QUOSR can facilitate modern symbolic regression methods by generating informative data.

Downloads

Published

2023-06-26

How to Cite

Jin, P., Huang, D., Zhang, R., Hu, X., Nan, Z., Du, Z., Guo, Q., & Chen, Y. (2023). Online Symbolic Regression with Informative Query. Proceedings of the AAAI Conference on Artificial Intelligence, 37(4), 5122-5130. https://doi.org/10.1609/aaai.v37i4.25641

Issue

Section

AAAI Technical Track on Domain(s) of Application