Rescind: Countering Image Misconduct in Biomedical Publications with Vision-Language and State-Space Modeling

Authors

  • Soumyaroop Nandi Information Sciences Institute, University of Southern California, Marina Del Rey, CA, USA Thomas Lord Department of Computer Science, University of Southern California, Los Angeles, CA, USA
  • Prem Natarajan Information Sciences Institute, University of Southern California, Marina Del Rey, CA, USA Thomas Lord Department of Computer Science, University of Southern California, Los Angeles, CA, USA CapitalOne, USA

DOI:

https://doi.org/10.1609/aaai.v40i46.41253

Abstract

Scientific image manipulation in biomedical publications poses a growing threat to research integrity and reproducibility. Unlike natural image forensics, biomedical forgery detection is uniquely challenging due to domain-specific artifacts, complex textures, and unstructured figure layouts. We present the first vision-language guided framework for both generating and detecting biomedical image forgeries. By combining diffusion-based synthesis with vision-language prompting, our method enables realistic and semantically controlled manipulations—including duplication, splicing, and region removal—across diverse biomedical modalities. We introduce Rescind, a large-scale benchmark featuring fine-grained annotations and modality-specific splits, and propose Integscan, a structured state-space modeling framework that integrates attention-enhanced visual encoding with prompt-conditioned semantic alignment for precise forgery localization. To ensure semantic fidelity, we incorporate a VLM-based verification loop that filters generated forgeries based on consistency with intended prompts. Extensive experiments on Rescind and existing benchmarks demonstrate that Integscan achieves state-of-the-art performance in both detection and localization, establishing a strong foundation for automated scientific integrity analysis.

Published

2026-03-14

How to Cite

Nandi, S., & Natarajan, P. (2026). Rescind: Countering Image Misconduct in Biomedical Publications with Vision-Language and State-Space Modeling. Proceedings of the AAAI Conference on Artificial Intelligence, 40(46), 39061–39069. https://doi.org/10.1609/aaai.v40i46.41253