(1)
Stolz, R.; Eichelbeck, M.; Althoff, M. Improving Stochastic Action-Constrained Reinforcement Learning via Truncated Distributions. AAAI 2026, 40, 25617-25626.