Keyphrase Extraction with Sequential Pattern Mining
DOI:
https://doi.org/10.1609/aaai.v31i1.11075Keywords:
keyphrase extraction, sequential pattern mining, entropy, interval calculationAbstract
Existing studies show that extracting a complete keyphrase candidate set is the first and crucial step to extract high quality keyphrases from documents. Based on a common sense that words do not repeatedly appear in an effective keyphrase, we propose a novel algorithm named KCSP for document-specific keyphrase candidate search using sequential pattern mining with gap constraints, which only needs to scan a document once and automatically specifies appropriate gap constraints for words without users’ participation. The experimental results confirm that it helps improve the quality of keyphrase extraction.