[1]
D. Mujtaba, B. Hu, A. Hoogs, and A. Basharat, “Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping”, AAAI, vol. 40, no. 44, pp. 37738–37746, Mar. 2026.