Xu, Xiao, Chenfei Wu, Shachar Rosenman, Vasudev Lal, Wanxiang Che, and Nan Duan. 2023. “BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning”. Proceedings of the AAAI Conference on Artificial Intelligence 37 (9):10637-47. https://doi.org/10.1609/aaai.v37i9.26263.