Ripple Shapley: Data Influence Attribution in One Federated Training Run

Authors

  • Dewen Zeng University of South China
  • Wenlong Tian University of South China
  • Haozhao Wang Huazhong University of Science and Technology
  • Jianfeng Lu Wuhan University of Science and Technology
  • Weijun Xiao Virginia Commonwealth University
  • Zhiyong Xu Suffolk University

DOI:

https://doi.org/10.1609/aaai.v40i33.40034

Abstract

Contribution evaluation is essential for incentivizing high-quality data sharing in federated learning (FL), yet existing Shapley-value-based methods are prohibitively expensive and overlook temporal influence propagation. In this paper, we propose Ripple Shapley, a novel attribution framework that enables accurate, real-time data valuation within a single federated training run. Our method decomposes each sample’s impact into an instantaneous drop term and a recursive ripple term, the latter capturing downstream influence via a Jacobian chain over global updates. To scale computation, we introduce a low-rank approximation of the Jacobian product and construct a shared subspace for efficient ripple accumulation. Extensive experiments on CIFAR-10 and MNIST show that Ripple Shapley achieves up to 62× speedup over existing Shapley-based FL methods while maintaining high attribution fidelity, significantly improving efficiency, robustness, and fairness in federated environments. We further demonstrate its effectiveness in dynamic federated learning scenarios and its potential for real-time data pricing.

Published

2026-03-14

How to Cite

Zeng, D., Tian, W., Wang, H., Lu, J., Xiao, W., & Xu, Z. (2026). Ripple Shapley: Data Influence Attribution in One Federated Training Run. Proceedings of the AAAI Conference on Artificial Intelligence, 40(33), 28085–28093. https://doi.org/10.1609/aaai.v40i33.40034

Issue

Section

AAAI Technical Track on Machine Learning X