Gaussian Approximation for Two-Timescale Linear Stochastic Approximation

Authors

  • Bogdan Butyrin Higher School of Economics
  • Artemy Rubtsov Higher School of Economics
  • Alexey Naumov Higher School of Economics
  • Vladimir V. Ulyanov Higher School of Economics Lomonosov Moscow State University
  • Sergey Samsonov Higher School of Economics

DOI:

https://doi.org/10.1609/aaai.v40i43.40986

Abstract

In this paper, we establish non-asymptotic bounds for accuracy of normal approximation for linear two-timescale stochastic approximation (TTSA) algorithms driven by martingale difference or Markov noise. Focusing on both the last iterate and Polyak–Ruppert averaging regimes, we derive bounds for normal approximation in terms of the convex distance between probability distributions. Our analysis reveals a non-trivial interaction between the fast and slow timescales: the normal approximation rate for the last iterate improves as the timescale separation increases, while it decreases in the Polyak–Ruppert averaged setting. We also provide the high-order moment bounds for the error of linear TTSA algorithm, which may be of independent interest. Finally, we demonstrate that our theoretical results are directly applicable to reinforcement learning algorithms such as GTD and TDC.

Downloads

Published

2026-03-14

How to Cite

Butyrin, B., Rubtsov, A., Naumov, A., Ulyanov, V. V., & Samsonov, S. (2026). Gaussian Approximation for Two-Timescale Linear Stochastic Approximation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(43), 36627–36635. https://doi.org/10.1609/aaai.v40i43.40986

Issue

Section

AAAI Technical Track on Reasoning under Uncertainty