Oxytrees: Model Trees for Bipartite Learning

Authors

  • Pedro Ilídio KU Leuven, Campus KULAK, Dept. of Public Health and Primary Care, Etienne Sabbelaan 53, Kortrijk, 8500, Belgium Itec, imec research group at KU Leuven, Etienne Sabbelaan 51, Kortrijk, 8500, Belgium
  • Felipe Kenji Nakano KU Leuven, Campus KULAK, Dept. of Public Health and Primary Care, Etienne Sabbelaan 53, Kortrijk, 8500, Belgium Itec, imec research group at KU Leuven, Etienne Sabbelaan 51, Kortrijk, 8500, Belgium
  • Alireza Gharahighehi KU Leuven, Campus KULAK, Dept. of Public Health and Primary Care, Etienne Sabbelaan 53, Kortrijk, 8500, Belgium Itec, imec research group at KU Leuven, Etienne Sabbelaan 51, Kortrijk, 8500, Belgium
  • Robbe D'hondt KU Leuven, Campus KULAK, Dept. of Public Health and Primary Care, Etienne Sabbelaan 53, Kortrijk, 8500, Belgium Itec, imec research group at KU Leuven, Etienne Sabbelaan 51, Kortrijk, 8500, Belgium
  • Ricardo Cerri Universidade de São Paulo, Instituto de Ciências Matemáticas e de Computação, Av. Trab. São Carlense, 13566-590, São Carlos, São Paulo, Brazil
  • Celine Vens KU Leuven, Campus KULAK, Dept. of Public Health and Primary Care, Etienne Sabbelaan 53, Kortrijk, 8500, Belgium Itec, imec research group at KU Leuven, Etienne Sabbelaan 51, Kortrijk, 8500, Belgium

DOI:

https://doi.org/10.1609/aaai.v40i26.39367

Abstract

Bipartite learning is a machine learning task that aims to predict interactions between pairs of instances. It has been applied to various domains, including drug-target interactions, RNA-disease associations, and regulatory network inference. Despite being widely investigated, current methods still present drawbacks, as they are often designed for a specific application and thus do not generalize to other problems or present scalability issues. To address these challenges, we propose Oxytrees: proxy-based biclustering model trees. Oxytrees compress the interaction matrix into row- and column-wise proxy matrices, significantly reducing training time without compromising predictive performance. We also propose a new leaf-assignment algorithm that significantly reduces the time taken for prediction. Finally, Oxytrees employ linear models using the Kronecker product kernel in their leaves, resulting in shallower trees and thus even faster training. Using 15 datasets, we compared the predictive performance of ensembles of Oxytrees with that of the current state-of-the-art. We achieved up to 30-fold improvement in training times compared to state-of-the-art biclustering forests, while demonstrating competitive or superior performance in most evaluation settings, particularly in the inductive setting. Finally, we provide an intuitive Python API to access all datasets, methods and evaluation measures used in this work, thus enabling reproducible research in this field.

Downloads

Published

2026-03-14

How to Cite

Ilídio, P., Nakano, F. K., Gharahighehi, A., D’hondt, R., Cerri, R., & Vens, C. (2026). Oxytrees: Model Trees for Bipartite Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(26), 22119–22127. https://doi.org/10.1609/aaai.v40i26.39367

Issue

Section

AAAI Technical Track on Machine Learning III