Return to Article Details An Improved Lower Bound on the Length of Locally-Improving Policy Sequences in MDPs with Large Action Sets Download Download PDF