Data Attribution: A Data-Centric Approach for Trustworthy AI Development
DOI:
https://doi.org/10.1609/aaai.v39i27.35114Abstract
Data plays an increasingly crucial role in both the performance and the safety of AI models. Data attribution is an emerging family of techniques aimed at quantifying the impact of individual training data points on a model trained on them, which has found data-centric applications such as instance-based explanation, unsafe training data detection, and copyright compensation. In this talk, I will comprehensively review our work contributing to the applications, methods, and open-source benchmarks of data attribution, and discuss open challenges in this field.Downloads
Published
2025-04-11
How to Cite
Ma, J. (2025). Data Attribution: A Data-Centric Approach for Trustworthy AI Development. Proceedings of the AAAI Conference on Artificial Intelligence, 39(27), 28720–28720. https://doi.org/10.1609/aaai.v39i27.35114
Issue
Section
New Faculty Highlights