Learning Robust and Privacy-Preserving Representations via Information Theory

Binghui Zhang; Sayedeh Leila Noorbakhsh; Yun Dong; Yuan Hong; Binghui Wang

doi:10.1609/aaai.v39i21.34392

Learning Robust and Privacy-Preserving Representations via Information Theory

Authors

Binghui Zhang Illinois Institute of Technology
Sayedeh Leila Noorbakhsh Illinois Institute of Technology
Yun Dong Milwaukee School of Engineering
Yuan Hong University of Connecticut
Binghui Wang Illinois Institute of Technology

DOI:

https://doi.org/10.1609/aaai.v39i21.34392

Abstract

Machine learning models are vulnerable to both security attacks (e.g., adversarial examples) and privacy attacks (e.g., private attribute inference). We take the first step to mitigate both the security and privacy attacks, and maintain task utility as well. Particularly, we propose an information-theoretic framework to achieve the goals through the lens of representation learning, i.e., learning representations that are robust to both adversarial examples and attribute inference adversaries. We also derive novel theoretical results under our framework, e.g., the inherent trade-off between adversarial robustness/utility and attribute privacy, and guaranteed attribute privacy leakage against attribute inference adversaries.

AAAI-25 / IAAI-25 / EAAI-25 Proceedings Cover

Downloads

Published

2025-04-11

How to Cite

Zhang, B., Noorbakhsh, S. L., Dong, Y., Hong, Y., & Wang, B. (2025). Learning Robust and Privacy-Preserving Representations via Information Theory. Proceedings of the AAAI Conference on Artificial Intelligence, 39(21), 22363–22371. https://doi.org/10.1609/aaai.v39i21.34392

Download Citation

Issue

Vol. 39 No. 21: AAAI-25 Technical Tracks 21

Section

AAAI Technical Track on Machine Learning VII