Advanced Black-Box Tuning of Large Language Models with Limited API Calls

Zhikang Xie; Weilin Wan; Peizhu Gong; Weizhong Zhang; Cheng Jin

doi:10.1609/aaai.v40i40.40702

Authors

Zhikang Xie College of Computer Science and Artificial Intelligence, Fudan University
Weilin Wan College of Computer Science and Artificial Intelligence, Fudan University
Peizhu Gong College of Computer Science and Artificial Intelligence, Fudan University
Weizhong Zhang School of Data Science, Fudan University
Cheng Jin College of Computer Science and Artificial Intelligence, Fudan University Shanghai Key Laboratory of Intelligent Information Processing

DOI:

https://doi.org/10.1609/aaai.v40i40.40702

Abstract

Black-box tuning is an emerging paradigm for adapting large language models (LLMs) to better achieve desired behaviors, particularly when direct access to model parameters is unavailable. Current strategies, however, often present a dilemma of suboptimal extremes: either separately train a small proxy model and then use it to shift the predictions of the foundation model, offering notable efficiency but often yielding limited improvement; or making API calls in each tuning iteration to the foundation model, which entails prohibitive computational costs. In this paper, we argue that a more reasonable way for black-box tuning is to train the proxy model with limited API calls. The underlying intuition is based on two key observations: first, the training samples may exhibit correlations and redundancies, suggesting that the foundation model’s predictions can be estimated from previous calls; second, foundation models frequently demonstrate low accuracy on downstream tasks. Therefore, we propose a novel advanced black-box tuning method for LLMs with limited API calls. Our core strategy involves training a Gaussian Process (GP) surrogate model with "LogitMap Pairs" derived from querying the foundation model on a minimal but highly informative training subset. This surrogate can approximate the outputs of the foundation model to guide the training of the proxy model, thereby effectively reducing the need for direct queries to the foundation model. Extensive experiments verify that our approach elevates pre-trained language model accuracy from 55.92% to 86.85%, reducing the frequency of API queries to merely 1.38%. This significantly outperforms offline approaches that operate entirely without API access. Notably, our method also achieves comparable or superior accuracy to query-intensive approaches, while significantly reducing API costs. This offers a robust and high-efficiency paradigm for language model adaptation.

Advanced Black-Box Tuning of Large Language Models with Limited API Calls

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information