Zhang, Z., Zhang, H., Zhao, L., Chen, T., Arik, S. Ö., & Pfister, T. (2022). Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and Interpretable Visual Understanding. Proceedings of the AAAI Conference on Artificial Intelligence, 36(3), 3417-3425. https://doi.org/10.1609/aaai.v36i3.20252