(1)
Tan, T.; Xiong, Z.; Dwaracherla, V. R. Parameterized Indexed Value Function for Efficient Exploration in Reinforcement Learning. AAAI 2020, 34, 5948-5955.