Wang, W., Jiang, Y., Sun, G., Dong, C., Jun, Z., Mengjie, L., … Chen, B. (2026). OmniBench: A Comprehensive Benchmark Integrating Real-World, Time-sensitive, and Multi-Hop Questions with a Multi-Dimensional Hybrid Evaluation Framework. Proceedings of the AAAI Conference on Artificial Intelligence, 40(40), 33657–33665. https://doi.org/10.1609/aaai.v40i40.40655