Lan, M., J. Zhang, F. He, and L. Zhang. “Siamese Network With Interactive Transformer for Video Object Segmentation”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 36, no. 2, June 2022, pp. 1228-36, doi:10.1609/aaai.v36i2.20009.