[1]
L. Guo, “K-ON: Stacking Knowledge on the Head Layer of Large Language Model”, AAAI, vol. 39, no. 11, pp. 11745–11753, Apr. 2025.