Reproduce, Replicate, Reevaluate. The Long but Safe Way to Extend Machine Learning Methods

Luisa Werner; Nabil Layaïda; Pierre Genevès; Jérôme Euzenat; Damien Graux

doi:10.1609/aaai.v38i14.29515

Authors

Luisa Werner Univ. Grenoble Alpes, Inria, CNRS, Grenoble INP, LIG F-38000 Grenoble, France
Nabil Layaïda Univ. Grenoble Alpes, Inria, CNRS, Grenoble INP, LIG F-38000 Grenoble, France
Pierre Genevès Univ. Grenoble Alpes, Inria, CNRS, Grenoble INP, LIG F-38000 Grenoble, France
Jérôme Euzenat Univ. Grenoble Alpes, Inria, CNRS, Grenoble INP, LIG F-38000 Grenoble, France
Damien Graux ADAPT SFI Research Centre, Trinity College Dublin, Ireland.

DOI:

https://doi.org/10.1609/aaai.v38i14.29515

Keywords:

ML: Transparent, Interpretable, Explainable ML, ML: Applications, ML: Ethics, Bias, and Fairness, ML: Graph-based Machine Learning, ML: Neuro-Symbolic Learning, ML: Representation Learning

Abstract

Reproducibility is a desirable property of scientific research. On the one hand, it increases confidence in results. On the other hand, reproducible results can be extended on a solid basis. In rapidly developing fields such as machine learning, the latter is particularly important to ensure the reliability of research. In this paper, we present a systematic approach to reproducing (using the available implementation), replicating (using an alternative implementation) and reevaluating (using different datasets) state-of-the-art experiments. This approach enables the early detection and correction of deficiencies and thus the development of more robust and transparent machine learning methods. We detail the independent reproduction, replication, and reevaluation of the initially published experiments with a method that we want to extend. For each step, we identify issues and draw lessons learned. We further discuss solutions that have proven effective in overcoming the encountered problems. This work can serve as a guide for further reproducibility studies and generally improve reproducibility in machine learning.

Reproduce, Replicate, Reevaluate. The Long but Safe Way to Extend Machine Learning Methods

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription