Mixture-of-Trees: Learning to Select and Weigh Reasoning Paths for Efficient LLM Inference

Yangbo Wei; Zhen Huang; Shaoqiang Lu; Junhong Qian; Dongge Qin; Ting Jung Lin; WEI W. XING; Chen Wu; Lei He

doi:10.1609/aaai.v40i40.40677

Authors

Yangbo Wei Shanghai Jiao Tong University Eastern Institute of Technology, Ningbo
Zhen Huang University of Science and Technology of China Eastern Institute of Technology, Ningbo
Shaoqiang Lu Shanghai Jiao Tong University Eastern Institute of Technology, Ningbo
Junhong Qian Southeast University
Dongge Qin Southeast University
Ting Jung Lin Eastern Institute of Technology, Ningbo
WEI W. XING University of Sheffield
Chen Wu Eastern Institute of Technology, Ningbo
Lei He Eastern Institute of Technology, Ningbo

DOI:

https://doi.org/10.1609/aaai.v40i40.40677

Abstract

We introduce Mixture-of-Trees (MoT), a novel framework that integrates sparse expert activation with structured tree-based reasoning for efficient LLM inference. MoT employs a learned gating mechanism to selectively activate only the most relevant expert reasoning trees for each problem, where experts use models of varying capacities based on task complexity. The framework features three key innovations: (1) sparse expert activation through unified gating networks, (2) specialized expert trees that leverage domain-specific expertise while optimizing the quality-efficiency trade-off, and (3) collaborative debate mechanisms for conflicting solutions. Additionally, MoT includes a shared baseline tree with early stopping—activated experts perform lightweight validation and terminate early when confidence is high. Experiments across five benchmarks (GSM8K, MATH, AIME 2024, MMLU, HotpotQA) show that MoT achieves 2-7 percentage point accuracy improvements while reducing LLM calls by 37-40% compared to existing multi-path methods.

Mixture-of-Trees: Learning to Select and Weigh Reasoning Paths for Efficient LLM Inference

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information