LIU, Le; WANG, Yuhao; SHEN, Bohan; ZENG, Wei; ZHANG, Shizhou; XU, Di; WANG, Peng. Do Large Language Models Reason About Uncertainty Like Humans? A Benchmark on Hurricane Forecast Visualization Comprehension. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 40, n. 21, p. 17571–17579, 2026. DOI: 10.1609/aaai.v40i21.38812. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/38812. Acesso em: 26 may. 2026.