Paulo, Gonçalo, Thomas Marshall, and Nora Belrose. 2025. “Do Transformer Interpretability Methods Transfer to RNNs?”. Proceedings of the AAAI Conference on Artificial Intelligence 39 (26):27565-72. https://doi.org/10.1609/aaai.v39i26.34969.