[1]
Y. Lu and W. Armour, “Beyond the Mean: Fisher-Orthogonal Projection for Natural Gradient Descent in Large Batch Training”, AAAI, vol. 40, no. 29, pp. 24115–24123, Mar. 2026.