Zhang, H., Cao, X., & Wang, R. (2018). Audio Visual Attribute Discovery for Fine-Grained Object Recognition. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1). https://doi.org/10.1609/aaai.v32i1.12295