Ripple Shapley: Data Influence Attribution in One Federated Training Run
DOI:
https://doi.org/10.1609/aaai.v40i33.40034Abstract
Contribution evaluation is essential for incentivizing high-quality data sharing in federated learning (FL), yet existing Shapley-value-based methods are prohibitively expensive and overlook temporal influence propagation. In this paper, we propose Ripple Shapley, a novel attribution framework that enables accurate, real-time data valuation within a single federated training run. Our method decomposes each sample’s impact into an instantaneous drop term and a recursive ripple term, the latter capturing downstream influence via a Jacobian chain over global updates. To scale computation, we introduce a low-rank approximation of the Jacobian product and construct a shared subspace for efficient ripple accumulation. Extensive experiments on CIFAR-10 and MNIST show that Ripple Shapley achieves up to 62× speedup over existing Shapley-based FL methods while maintaining high attribution fidelity, significantly improving efficiency, robustness, and fairness in federated environments. We further demonstrate its effectiveness in dynamic federated learning scenarios and its potential for real-time data pricing.Downloads
Published
2026-03-14
How to Cite
Zeng, D., Tian, W., Wang, H., Lu, J., Xiao, W., & Xu, Z. (2026). Ripple Shapley: Data Influence Attribution in One Federated Training Run. Proceedings of the AAAI Conference on Artificial Intelligence, 40(33), 28085–28093. https://doi.org/10.1609/aaai.v40i33.40034
Issue
Section
AAAI Technical Track on Machine Learning X