[1]
D. Mukherjee and S. Kalyanakrishnan, “Howard’s Policy Iteration is Subexponential for Deterministic Markov Decision Problems with Rewards of Fixed Bit-size and Arbitrary Discount Factor”, ICAPS, vol. 35, no. 1, pp. 84-92, Sep. 2025.