[1]

T. Tan, Z. Xiong, and V. R. Dwaracherla, “Parameterized Indexed Value Function for Efficient Exploration in Reinforcement Learning”, AAAI, vol. 34, no. 04, pp. 5948-5955, Apr. 2020.