Relative Advantage Debiasing for Watch-Time Prediction in Short-Video Recommendation

Authors

  • Emily Liu ByteDance Inc.
  • Kuan Han ByteDance Inc.
  • Minfeng Zhan ByteDance Inc.
  • Bocheng Zhao ByteDance Inc.
  • Guanyu Mu ByteDance Inc.
  • Yang Song ByteDance Inc.

DOI:

https://doi.org/10.1609/aaai.v40i18.38555

Abstract

Watch time is widely used as a proxy for user satisfaction in video recommendation platforms. However, raw watch times are influenced by confounding factors such as video duration, popularity, and individual user behaviors, potentially distorting preference signals and resulting in biased recommendation models. We propose a novel relative advantage debiasing framework that corrects watch time by comparing it to empirically derived reference distributions conditioned on user and item groups. This approach yields a quantile-based preference signal and introduces a two-stage architecture that explicitly separates distribution estimation from preference learning. Additionally, we present distributional embeddings to efficiently parameterize watch-time quantiles without requiring online sampling or storage of historical data. Both offline and online experiments demonstrate significant improvements in recommendation accuracy and robustness compared to existing baseline methods.

Downloads

Published

2026-03-14

How to Cite

Liu, E., Han, K., Zhan, M., Zhao, B., Mu, G., & Song, Y. (2026). Relative Advantage Debiasing for Watch-Time Prediction in Short-Video Recommendation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(18), 15296–15305. https://doi.org/10.1609/aaai.v40i18.38555

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management II