(1)
Liao, J.; Zhang, T.; Feng, X.; Zhang, Y.; Wang, H.; Wen, B.; Wang, Z.; Shi, R. RLMR: Reinforcement Learning With Mixed Rewards for Creative Writing. AAAI 2026, 40, 31970-31978.