Reinforcement Learning-based Product Delivery Frequency Control

Authors

  • Yang Liu Facebook, Menlo Park, CA
  • Zhengxing Chen Facebook, Menlo Park, CA
  • Kittipat Virochsiri Facebook, Menlo Park, CA
  • Juan Wang Facebook, Menlo Park, CA
  • Jiahao Wu Facebook, Menlo Park, CA
  • Feng Liang Facebook, Menlo Park, CA

DOI:

https://doi.org/10.1609/aaai.v35i17.17803

Keywords:

Machine Learning, Reinforcement Learning, Frequency Capping, Personalization, Notifications

Abstract

Frequency control is an important problem in modern recommender systems. It dictates the delivery frequency of recommendations to maintain product quality and efficiency. For example, the frequency of delivering promotional notifications impacts daily metrics as well as the infrastructure resource consumption (e.g. CPU and memory usage). There remain open questions on what objective we should optimize to represent business values in the long term best, and how we should balance between daily metrics and resource consumption in a dynamically fluctuating environment. We propose a personalized methodology for the frequency control problem, which combines long-term value optimization using reinforcement learning (RL) with a robust volume control technique we termed "Effective Factor". We demonstrate statistically significant improvement in daily metrics and resource efficiency by our method in several notification applications at a scale of billions of users. To our best knowledge, our study represents the first deep RL application on the frequency control problem at such an industrial scale.

Downloads

Published

2021-05-18

How to Cite

Liu, Y., Chen, Z., Virochsiri, K., Wang, J., Wu, J., & Liang, F. (2021). Reinforcement Learning-based Product Delivery Frequency Control. Proceedings of the AAAI Conference on Artificial Intelligence, 35(17), 15355-15361. https://doi.org/10.1609/aaai.v35i17.17803

Issue

Section

IAAI Technical Track on Emerging Applications of AI