RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting

Authors

  • Lei Shu Google Research
  • Liangchen Luo Google Research
  • Jayakumar Hoskere Google Research
  • Yun Zhu Google Research
  • Yinxiao Liu Google Research
  • Simon Tong Google Research
  • Jindong Chen Google Research
  • Lei Meng Google Research

DOI:

https://doi.org/10.1609/aaai.v38i17.29863

Keywords:

NLP: (Large) Language Models, NLP: Applications

Abstract

Large Language Models (LLMs) have demonstrated impressive capabilities in creative tasks such as storytelling and E-mail generation. However, as LLMs are primarily trained on final text results rather than intermediate revisions, it might be challenging for them to perform text rewriting tasks. Most studies in the rewriting tasks focus on a particular transformation type within the boundaries of single sentences. In this work, we develop new strategies for instruction tuning and reinforcement learning to better align LLMs for cross-sentence rewriting tasks using diverse wording and structures expressed through natural languages including 1) generating rewriting instruction data from Wiki edits and public corpus through instruction generation and chain-of-thought prompting; 2) collecting comparison data for reward model training through a new ranking function. To facilitate this research, we introduce OpenRewriteEval, a novel benchmark covers a wide variety of rewriting types expressed through natural language instructions. Our results show significant improvements over a variety of baselines.

Published

2024-03-24

How to Cite

Shu, L., Luo, L., Hoskere, J., Zhu, Y., Liu, Y., Tong, S., Chen, J., & Meng, L. (2024). RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting. Proceedings of the AAAI Conference on Artificial Intelligence, 38(17), 18970-18980. https://doi.org/10.1609/aaai.v38i17.29863

Issue

Section

AAAI Technical Track on Natural Language Processing II