(1)
Zhang, L.; Yang, T.; Jin, R.; Zhou, Z.-H. Online Bandit Learning for a Special Class of Non-Convex Losses. AAAI 2015, 29.