Wen, S., Chen, H., Wang, Y., Pan, Z., Chen, X., Tian, Y., … Huang, S.-J. (2026). MultiMedBench: A Scenario-Aware Benchmark for Evaluating Knowledge Editing in Medical VQA. Proceedings of the AAAI Conference on Artificial Intelligence, 40(40), 33872–33880. https://doi.org/10.1609/aaai.v40i40.40679