Sim-to-Real: An Unsupervised Noise Layer for Screen-Camera Watermarking Robustness

Authors

  • Yufeng Wu College of Cyber Science and Technology, Hunan University, Changsha 410082, China
  • Xin Liao College of Cyber Science and Technology, Hunan University, Changsha 410082, China
  • Baowei Wang the Engineering Research Center of Digital Forensics, Ministry of Education, the School of Computer Science, Jiangsu Collaborative Innovation Center of Atmospheric Environment and Equipment Technology, Nanjing University of Information Science and Technology, Nanjing 210044, China Jiangsu Yuchi Blockchain Technology Research Institute, Nanjing 210000, China
  • Han Fang School of Computing, National University of Singapore, Singapore
  • Xiaoshuai Wu College of Cyber Science and Technology, Hunan University, Changsha 410082, China
  • Mingyue Chen College of Cyber Science and Technology, Hunan University, Changsha 410082, China
  • Guiling Wang Department of Computer Science, New Jersey Institute of Technology, Newark, NJ 07102, USA

DOI:

https://doi.org/10.1609/aaai.v40i2.37103

Abstract

Unauthorized screen capturing and dissemination pose severe security threats such as data leakage and information theft. Several studies propose robust watermarking methods to track the copyright of Screen-Camera (SC) images, facilitating post-hoc certification against infringement. These techniques typically employ heuristic mathematical modeling or supervised neural network fitting as the noise layer, to enhance watermarking robustness against SC. However, both strategies cannot fundamentally achieve an effective approximation of SC noise. Mathematical simulation suffers from biased approximations due to the incomplete decomposition of the noise and the absence of interdependence among the noise components. Supervised networks require paired data to train the noise-fitting model, and it is difficult for the model to learn all the features of the noise. To address the above issues, we propose Simulation-to-Real (S2R). Specifically, an unsupervised noise layer employs unpaired data to learn the discrepancy between the modeled simulated noise distribution and the real-world SC noise distribution, rather than directly learning the mapping from sharp images to real-world images. Learning this transformation from simulation to reality is inherently simpler, as it primarily involves bridging the gap in noise distributions, instead of the complex task of reconstructing fine-grained image details. Extensive experimental results validate the efficacy of the proposed method, demonstrating superior watermark robustness and generalization compared to state-of-the-art methods.

Downloads

Published

2026-03-14

How to Cite

Wu, Y., Liao, X., Wang, B., Fang, H., Wu, X., Chen, M., & Wang, G. (2026). Sim-to-Real: An Unsupervised Noise Layer for Screen-Camera Watermarking Robustness. Proceedings of the AAAI Conference on Artificial Intelligence, 40(2), 1303–1310. https://doi.org/10.1609/aaai.v40i2.37103

Issue

Section

AAAI Technical Track on Application Domains II