Stress-Testing of Multimodal Models in Medical Image-Based Report Generation

Authors

  • Flávia Carvalhido LIACC - Artificial Intelligence and Computer Science Laboratory, Faculty of Engineering, University of Porto
  • Henrique Lopes Cardoso LIACC - Artificial Intelligence and Computer Science Laboratory, Faculty of Engineering, University of Porto
  • Vítor Cerqueira LIACC - Artificial Intelligence and Computer Science Laboratory, Faculty of Engineering, University of Porto

DOI:

https://doi.org/10.1609/aaai.v39i28.35203

Abstract

Multimodal models, namely vision-language models, present unique possibilities through the seamless integration of different information mediums for data generation. These models mostly act as a black-box, making them lack transparency and explicability. Reliable results require accountable and trustworthy Artificial Intelligence (AI), namely when in use for critical tasks, such as the automatic generation of medical imaging reports for healthcare diagnosis. By exploring stress-testing techniques, multimodal generative models can become more transparent by disclosing their shortcomings, further supporting their responsible usage in the medical field.

Downloads

Published

2025-04-11

How to Cite

Carvalhido, F., Lopes Cardoso, H., & Cerqueira, V. (2025). Stress-Testing of Multimodal Models in Medical Image-Based Report Generation. Proceedings of the AAAI Conference on Artificial Intelligence, 39(28), 29251–29252. https://doi.org/10.1609/aaai.v39i28.35203