Learning Chemical Rules of Retrosynthesis with Pre-training

Authors

  • Yinjie Jiang Zhejiang University
  • Ying WEI City University of Hong Kong
  • Fei Wu Zhejiang University Shanghai Institute for Advanced Study of Zhejiang University
  • Zhengxing Huang Zhejiang University
  • Kun Kuang Zhejiang University
  • Zhihua Wang Shanghai Institute for Advanced Study of Zhejiang University

DOI:

https://doi.org/10.1609/aaai.v37i4.25640

Keywords:

APP: Healthcare, Medicine & Wellness

Abstract

Retrosynthesis aided by artificial intelligence has been a very active and bourgeoning area of research, for its critical role in drug discovery as well as material science. Three categories of solutions, i.e., template-based, template-free, and semi-template methods, constitute mainstream solutions to this problem. In this paper, we focus on template-free methods which are known to be less bothered by the template generalization issue and the atom mapping challenge. Among several remaining problems regarding template-free methods, failing to conform to chemical rules is pronounced. To address the issue, we seek for a pre-training solution to empower the pre-trained model with chemical rules encoded. Concretely, we enforce the atom conservation rule via a molecule reconstruction pre-training task, and the reaction rule that dictates reaction centers via a reaction type guided contrastive pre-training task. In our empirical evaluation, the proposed pre-training solution substantially improves the single-step retrosynthesis accuracies in three downstream datasets.

Downloads

Published

2023-06-26

How to Cite

Jiang, Y., WEI, Y., Wu, F., Huang, Z., Kuang, K., & Wang, Z. (2023). Learning Chemical Rules of Retrosynthesis with Pre-training. Proceedings of the AAAI Conference on Artificial Intelligence, 37(4), 5113-5121. https://doi.org/10.1609/aaai.v37i4.25640

Issue

Section

AAAI Technical Track on Domain(s) of Application