[1]
S. Zhang, “Video-Audio Domain Generalization via Confounder Disentanglement”, AAAI, vol. 37, no. 12, pp. 15322-15330, Jun. 2023.