Constrained Molecule Generation Modelled Using the Grammar Constraint
DOI:
https://doi.org/10.1609/aaai.v40i17.38446Abstract
Drug discovery is a very time-consuming and costly endeavour due to its huge design space and to the lengthy and failure-fraught process of bringing a product to market. Automating the generation of candidate molecules exhibiting some of the desired properties can help. Among the standard formats to encode molecules, SMILES is a widespread string representation. We propose a constraint programming model showcasing the grammar constraint to express the design space of organic molecules using the SMILES notation. We show how some common physicochemical properties --- such as molecular weight and lipophilicity --- and structural features can be expressed as constraints in the model. We also contribute a weighted counting algorithm for the grammar constraint, allowing us to use a belief propagation heuristic to guide the generation. Our experiments indicate that such a heuristic is key to driving the search towards desired molecules.Downloads
Published
2026-03-14
How to Cite
Saikali, D., & Pesant, G. (2026). Constrained Molecule Generation Modelled Using the Grammar Constraint. Proceedings of the AAAI Conference on Artificial Intelligence, 40(17), 14313–14321. https://doi.org/10.1609/aaai.v40i17.38446
Issue
Section
AAAI Technical Track on Constraint Satisfaction and Optimization