Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches

Yu Liu; Runzhe Wan; James McQueen; Doug Hains; Jinxiang Gu; Rui Song

doi:10.1609/aaai.v38i12.29313

Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches

Authors

Yu Liu Amazon.com Inc
Runzhe Wan Amazon.com Inc
James McQueen Amazon.com Inc
Doug Hains Amazon.com Inc
Jinxiang Gu Amazon.com Inc
Rui Song Amazon.com Inc

DOI:

https://doi.org/10.1609/aaai.v38i12.29313

Keywords:

ML: Applications, ML: Auto ML and Hyperparameter Tuning, ML: Clustering, RU: Decision/Utility Theory, RU: Probabilistic Inference

Abstract

The selection of the assumed effect size (AES) critically determines the duration of an experiment, and hence its accuracy and efficiency. Traditionally, experimenters determine AES based on domain knowledge. However, this method becomes impractical for online experimentation services managing numerous experiments, and a more automated approach is hence of great demand. We initiate the study of data-driven AES selection in for online experimentation services by introducing two solutions. The first employs a three-layer Gaussian Mixture Model considering the heteroskedasticity across experiments, and it seeks to estimate the true expected effect size among positive experiments. The second method, grounded in utility theory, aims to determine the optimal effect size by striking a balance between the experiment's cost and the precision of decision-making. Through comparisons with baseline methods using both simulated and real data, we showcase the superior performance of the proposed approaches.

AAAI-24 / IAAI-24 / EAAI-24 Proceedings Cover

Downloads

Published

2024-03-24

How to Cite

Liu, Y., Wan, R., McQueen, J., Hains, D., Gu, J., & Song, R. (2024). Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches. Proceedings of the AAAI Conference on Artificial Intelligence, 38(12), 14044–14051. https://doi.org/10.1609/aaai.v38i12.29313

Download Citation

Issue

Vol. 38 No. 12: AAAI-24 Technical Tracks 12

Section

AAAI Technical Track on Machine Learning III

Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information