Return to Article Details Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach Download Download PDF