Fast and Constrained Absent Keyphrase Generation by Prompt-Based Learning


  • Huanqin Wu Tencent
  • Baijiaxin Ma Peking University
  • Wei Liu Tencent
  • Tao Chen Tencent
  • Dan Nie Tencent



Speech & Natural Language Processing (SNLP)


Generating absent keyphrases, which do not appear in the input document, is challenging in the keyphrase prediction task. Most previous works treat the problem as an autoregressive sequence-to-sequence generation task, which demonstrates promising results for generating grammatically correct and fluent absent keyphrases. However, such an end-to-end process with a complete data-driven manner is unconstrained, which is prone to generate keyphrases inconsistent with the input document. In addition, the existing autoregressive decoding method makes the generation of keyphrases must be done from left to right, leading to slow speed during inference. In this paper, we propose a constrained absent keyphrase generation method in a prompt-based learning fashion. Specifically, the prompt will be created firstly based on the keywords, which are defined as the overlapping words between absent keyphrase and document. Then, a mask-predict decoder is used to complete the absent keyphrase on the constraint of prompt. Experiments on keyphrase generation benchmarks have demonstrated the effectiveness of our approach. In addition, we evaluate the performance of constrained absent keyphrases generation from an information retrieval perspective. The result shows that our approach can generate more consistent keyphrases, which can improve document retrieval performance. What’s more, with a non-autoregressive decoding manner, our model can speed up the absent keyphrase generation by 8.67× compared with the autoregressive method.




How to Cite

Wu, H., Ma, B., Liu, W., Chen, T., & Nie, D. (2022). Fast and Constrained Absent Keyphrase Generation by Prompt-Based Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 36(10), 11495-11503.



AAAI Technical Track on Speech and Natural Language Processing