1.
Wang W, Jiang Y, Sun G, Dong C, Jun Z, Mengjie L, et al. OmniBench: A Comprehensive Benchmark Integrating Real-World, Time-sensitive, and Multi-Hop Questions with a Multi-Dimensional Hybrid Evaluation Framework. AAAI [Internet]. 2026 Mar. 14 [cited 2026 May 14];40(40):33657-65. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/40655