(1)
Liu, L.; Wang, Y.; Shen, B.; Zeng, W.; Zhang, S.; Xu, D.; Wang, P. Do Large Language Models Reason About Uncertainty Like Humans? A Benchmark on Hurricane Forecast Visualization Comprehension. AAAI 2026, 40, 17571-17579.