MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization

Authors

  • Tianfan Fu College of Computing, Georgia Institute of Technology
  • Cao Xiao Analytics Center of Excellence, IQVIA
  • Xinhao Li Department of Chemistry, North Carolina State University
  • Lucas M. Glass Analytics Center of Excellence, IQVIA Department of Statistics, Temple University
  • Jimeng Sun Department of Computer Science, University of Illinois, Urbana-Champaign

DOI:

https://doi.org/10.1609/aaai.v35i1.16085

Keywords:

Healthcare, Medicine & Wellness

Abstract

Molecule optimization is a fundamental task for accelerating drug discovery, with the goal of generating new valid molecules that maximize multiple drug properties while maintaining similarity to the input molecule. Existing generative models and reinforcement learning approaches made initial success, but still face difficulties in simultaneously optimizing multiple drug properties. To address such challenges, we propose the MultI-constraint MOlecule SAmpling (MIMOSA) approach, a sampling framework to use input molecule as an initial guess and sample molecules from the target distribution. MIMOSA first pretrains two property agnostic graph neural networks (GNNs) for molecule topology and substructure-type prediction, where a substructure can be either atom or single ring. For each iteration, MIMOSA uses the GNNs’ prediction and employs three basic substructure operations (add, replace, delete) to generate new molecules and associated weights. The weights can encode multiple constraints including similarity and drug property constraints, upon which we select promising molecules for next iteration. MIMOSA enables flexible encoding of multiple property- and similarity-constraints and can efficiently generate new molecules that satisfy various property constraints and achieved up to 49.1% relative improvement over the best baseline in terms of success rate.

Downloads

Published

2021-05-18

How to Cite

Fu, T., Xiao, C., Li, X., Glass, L. M., & Sun, J. (2021). MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization. Proceedings of the AAAI Conference on Artificial Intelligence, 35(1), 125-133. https://doi.org/10.1609/aaai.v35i1.16085

Issue

Section

AAAI Technical Track on Application Domains