AdaO2B: Adaptive Online to Batch Conversion for Out-of-Distribution Generalization

Authors

  • Xiao Zhang Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China
  • Sunhao Dai Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China
  • Jun Xu Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China
  • Yong Liu Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China
  • Zhenhua Dong Huawei Noah's Ark Lab, Shenzhen, China

DOI:

https://doi.org/10.1609/aaai.v39i21.34418

Abstract

Online to batch conversion involves constructing a new batch learner by utilizing a series of models generated by an existing online learning algorithm, for achieving generalization guarantees under i.i.d assumption. However, when applied to real-world streaming applications such as streaming recommender systems, the data stream may be sampled from time-varying distributions instead of persistently being i.i.d. This poses a challenge in terms of out-of-distribution (OOD) generalization. Existing approaches employ fixed conversion mechanisms that are unable to adapt to novel testing distributions, hindering the testing accuracy of the batch learner. To address these issues, we propose AdaO2B, an adaptive online to batch conversion approach under the bandit setting. AdaO2B is designed to be aware of the distribution shifts in the testing data and achieves OOD generalization guarantees. Specifically, AdaO2B can dynamically combine the sequence of models learned by a contextual bandit algorithm and determine appropriate combination weights using a context-aware weighting function. This innovative approach allows for the conversion of a sequence of models into a batch learner that facilitates OOD generalization. Theoretical analysis provides justification for why and how the learned adaptive batch learner can achieve OOD generalization error guarantees. Experimental results have demonstrated that AdaO2B significantly outperforms state-of-the-art baselines on both synthetic and real-world recommendation datasets.

Published

2025-04-11

How to Cite

Zhang, X., Dai, S., Xu, J., Liu, Y., & Dong, Z. (2025). AdaO2B: Adaptive Online to Batch Conversion for Out-of-Distribution Generalization. Proceedings of the AAAI Conference on Artificial Intelligence, 39(21), 22596-22604. https://doi.org/10.1609/aaai.v39i21.34418

Issue

Section

AAAI Technical Track on Machine Learning VII