1.
Mujtaba D, Hu B, Hoogs A, Basharat A. Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping. AAAI [Internet]. 2026 Mar. 14 [cited 2026 May 19];40(44):37738-46. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/41109