(1)
Mujtaba, D.; Hu, B.; Hoogs, A.; Basharat, A. Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping. AAAI 2026, 40, 37738-37746.