(1)
Lobel, S.; Gottesman, O.; Allen, C.; Bagaria, A.; Konidaris, G. Optimistic Initialization for Exploration in Continuous Control. AAAI 2022, 36, 7612-7619.