[1]

C. Xu, Z. He, Z. He, and J. McAuley, “Leashing the Inner Demons: Self-Detoxification for Language Models”, AAAI, vol. 36, no. 10, pp. 11530–11537, Jun. 2022.