Reinforcement Learning-based Product Delivery Frequency Control

Yang Liu; Zhengxing Chen; Kittipat Virochsiri; Juan Wang; Jiahao Wu; Feng Liang

doi:10.1609/aaai.v35i17.17803

Authors

Yang Liu Facebook, Menlo Park, CA
Zhengxing Chen Facebook, Menlo Park, CA
Kittipat Virochsiri Facebook, Menlo Park, CA
Juan Wang Facebook, Menlo Park, CA
Jiahao Wu Facebook, Menlo Park, CA
Feng Liang Facebook, Menlo Park, CA

DOI:

https://doi.org/10.1609/aaai.v35i17.17803

Keywords:

Machine Learning, Reinforcement Learning, Frequency Capping, Personalization, Notifications

Abstract

Frequency control is an important problem in modern recommender systems. It dictates the delivery frequency of recommendations to maintain product quality and efficiency. For example, the frequency of delivering promotional notifications impacts daily metrics as well as the infrastructure resource consumption (e.g. CPU and memory usage). There remain open questions on what objective we should optimize to represent business values in the long term best, and how we should balance between daily metrics and resource consumption in a dynamically fluctuating environment. We propose a personalized methodology for the frequency control problem, which combines long-term value optimization using reinforcement learning (RL) with a robust volume control technique we termed "Effective Factor". We demonstrate statistically significant improvement in daily metrics and resource efficiency by our method in several notification applications at a scale of billions of users. To our best knowledge, our study represents the first deep RL application on the frequency control problem at such an industrial scale.

Reinforcement Learning-based Product Delivery Frequency Control

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription