Agents on a LEASH: A Case Study in Micro-Managing Web Agent Behavior

Authors

  • Matthew Michelson InferLink Corporation
  • Adrien Bibal InferLink Corporation
  • Steven Minton InferLink Corporation

DOI:

https://doi.org/10.1609/aaaiss.v8i1.42538

Abstract

AI web agents that perform tasks on behalf of users will transform business through massive productivity gains via automation of specific tasks using a web browser. However, current commercial agents and agentic frameworks offer few guarantees that erroneous behavior will be limited. In this paper, we present LEASH, a framework that borrows from planning and focuses on defining micro-tasks for an agent that consists of both sensing and actions. By pre-sensing before an action and post-sensing after, the agent can drastically increase the likelihood of accurate behavior and mitigate risk. We present a real use-case where a LEASH agent performs autonomous pre-screening tasks of articles submitted to the Journal of AI Research (JAIR). The LEASH agent performs the tasks accurately, at scale, and without harm. We compare this to a commercially available agent which may perform well, at times, but which also exhibited grossly uncontrolled behavior.

Downloads

Published

2026-05-18

How to Cite

Michelson, M., Bibal, A., & Minton, S. (2026). Agents on a LEASH: A Case Study in Micro-Managing Web Agent Behavior. Proceedings of the AAAI Symposium Series, 8(1), 179–187. https://doi.org/10.1609/aaaiss.v8i1.42538