ScriptSmith: A Unified LLM Framework for Enhancing IT Operations via Automated Bash Script Generation, Assessment, and Refinement

Authors

  • Pooja Aggarwal International Business Machines
  • Oishik Chatterjee International Business Machines
  • Ting Dai International Business Machines
  • Suranjana Samanta International Business Machines
  • Prateeti Mohapatra International Business Machines
  • Debanjana Kar International Business Machines
  • Ruchi Mahindru International Business Machines
  • Steve Barbier International Business Machines
  • Eugen Postea International Business Machines
  • Brad Blancett International Business Machines
  • Arthur de Magalhaes International Business Machines

DOI:

https://doi.org/10.1609/aaai.v39i28.35147

Abstract

In the rapidly evolving landscape of site reliability engineering (SRE), the demand for efficient and effective solutions to manage and resolve issues in site and cloud applications is paramount. This paper presents an innovative approach to action automation using large language models (LLMs) for script generation, assessment, and refinement. By leveraging the capabilities of LLMs, we aim to significantly reduce the human effort involved in writing and debugging scripts, thereby enhancing the productivity of SRE teams. Our experiments focus on Bash scripts, a commonly used tool in SRE, and involve the CodeSift dataset of 100 tasks and the InterCode dataset of 153 tasks. The results show that LLMs can automatically assess and refine scripts efficiently, reducing the need for script validation in an execution environment. Results demonstrate that the framework shows an overall improvement of 7-10% in script generation.

Downloads

Published

2025-04-11

How to Cite

Aggarwal, P., Chatterjee, O., Dai, T., Samanta, S., Mohapatra, P., Kar, D., … de Magalhaes, A. (2025). ScriptSmith: A Unified LLM Framework for Enhancing IT Operations via Automated Bash Script Generation, Assessment, and Refinement. Proceedings of the AAAI Conference on Artificial Intelligence, 39(28), 28829–28835. https://doi.org/10.1609/aaai.v39i28.35147