Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches

Authors

  • Yu Liu Amazon.com Inc
  • Runzhe Wan Amazon.com Inc
  • James McQueen Amazon.com Inc
  • Doug Hains Amazon.com Inc
  • Jinxiang Gu Amazon.com Inc
  • Rui Song Amazon.com Inc

DOI:

https://doi.org/10.1609/aaai.v38i12.29313

Keywords:

ML: Applications, ML: Auto ML and Hyperparameter Tuning, ML: Clustering, RU: Decision/Utility Theory, RU: Probabilistic Inference

Abstract

The selection of the assumed effect size (AES) critically determines the duration of an experiment, and hence its accuracy and efficiency. Traditionally, experimenters determine AES based on domain knowledge. However, this method becomes impractical for online experimentation services managing numerous experiments, and a more automated approach is hence of great demand. We initiate the study of data-driven AES selection in for online experimentation services by introducing two solutions. The first employs a three-layer Gaussian Mixture Model considering the heteroskedasticity across experiments, and it seeks to estimate the true expected effect size among positive experiments. The second method, grounded in utility theory, aims to determine the optimal effect size by striking a balance between the experiment's cost and the precision of decision-making. Through comparisons with baseline methods using both simulated and real data, we showcase the superior performance of the proposed approaches.

Published

2024-03-24

How to Cite

Liu, Y., Wan, R., McQueen, J., Hains, D., Gu, J., & Song, R. (2024). Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches. Proceedings of the AAAI Conference on Artificial Intelligence, 38(12), 14044–14051. https://doi.org/10.1609/aaai.v38i12.29313

Issue

Section

AAAI Technical Track on Machine Learning III