Gene Regulatory Network Inference as Relaxed Graph Matching

Deborah Weighill; Marouen Ben Guebila; Camila Lopes-Ramos; Kimberly Glass; John Quackenbush; John Platig; Rebekka Burkholz

doi:10.1609/aaai.v35i11.17230

Authors

Deborah Weighill Harvard T.H. Chan School of Public Health, Boston, MA 02115
Marouen Ben Guebila Harvard T.H. Chan School of Public Health, Boston, MA 02115
Camila Lopes-Ramos Harvard T.H. Chan School of Public Health, Boston, MA 02115
Kimberly Glass Harvard T.H. Chan School of Public Health, Boston, MA 02115 Channing Division of Network Medicine, Brigham and Women’s Hospital Harvard Medical School, Boston, MA 02115
John Quackenbush Harvard T.H. Chan School of Public Health, Boston, MA 02115 Channing Division of Network Medicine, Brigham and Women’s Hospital Harvard Medical School, Boston, MA 02115
John Platig Channing Division of Network Medicine, Brigham and Women’s Hospital Harvard Medical School, Boston, MA 02115
Rebekka Burkholz Harvard T.H. Chan School of Public Health, Boston, MA 02115

DOI:

https://doi.org/10.1609/aaai.v35i11.17230

Keywords:

Graph-based Machine Learning, Bioinformatics

Abstract

Bipartite network inference is a ubiquitous problem across disciplines. One important example in the field molecular biology is gene regulatory network inference. Gene regulatory networks are an instrumental tool aiding in the discovery of the molecular mechanisms driving diverse diseases, including cancer. However, only noisy observations of the projections of these regulatory networks are typically assayed. In an effort to better estimate regulatory networks from their noisy projections, we formulate a non-convex but analytically tractable optimization problem called OTTER. This problem can be interpreted as relaxed graph matching between the two projections of the bipartite network. OTTER's solutions can be derived explicitly and inspire a spectral algorithm, for which we provide network recovery guarantees. We also provide an alternative approach based on gradient descent that is more robust to noise compared to the spectral algorithm. Interestingly, this gradient descent approach resembles the message passing equations of an established gene regulatory network inference method, PANDA. Using three cancer-related data sets, we show that OTTER outperforms state-of-the-art inference methods in predicting transcription factor binding to gene regulatory regions. To encourage new graph matching applications to this problem, we have made all networks and validation data publicly available.

Gene Regulatory Network Inference as Relaxed Graph Matching

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription