TY - JOUR AU - Ding, Li AU - Wang, Yongwei AU - Yuan, Kaiwen AU - Jiang, Minyang AU - Wang, Ping AU - Huang, Hua AU - Wang, Z. Jane PY - 2021/05/18 Y2 - 2024/03/28 TI - Towards Universal Physical Attacks on Single Object Tracking JF - Proceedings of the AAAI Conference on Artificial Intelligence JA - AAAI VL - 35 IS - 2 SE - AAAI Technical Track on Computer Vision I DO - 10.1609/aaai.v35i2.16211 UR - https://ojs.aaai.org/index.php/AAAI/article/view/16211 SP - 1236-1245 AB - Recent studies show that small perturbations in video frames could misguide single object trackers. However, such attacks have been mainly designed for digital-domain videos (i.e., perturbation on full images), which makes them practically infeasible to evaluate the adversarial vulnerability of trackers in real-world scenarios. Here we made the first step towards physically feasible adversarial attacks against visual tracking in real scenes with a universal patch to camouflage single object trackers. Fundamentally different from physical object detection, the essence of single object tracking lies in the feature matching between the search image and templates, and we therefore specially design the maximum textural discrepancy (MTD), a resolution-invariant and target location-independent feature de-matching loss. The MTD distills global textural information of the template and search images at hierarchical feature scales prior to performing feature attacks. Moreover, we evaluate two shape attacks, the regression dilation and shrinking, to generate stronger and more controllable attacks. Further, we employ a set of transformations to simulate diverse visual tracking scenes in the wild. Experimental results show the effectiveness of the physically feasible attacks on SiamMask and SiamRPN++ visual trackers both in digital and physical scenes. ER -