Learning to Surface Deep Web Content

Zhaohui Wu; Lu Jiang; Qinghua Zheng; Jun Liu

doi:10.1609/aaai.v24i1.7779

Learning to Surface Deep Web Content

Authors

Zhaohui Wu Xi'an Jiaotong University
Lu Jiang Xi'an Jiaotong University
Qinghua Zheng Xi'an Jiaotong University
Jun Liu Xi'an Jiaotong University

DOI:

https://doi.org/10.1609/aaai.v24i1.7779

Keywords:

hidden web, deep web crawling, reinforcement learning

Abstract

We propose a novel deep web crawling framework based on reinforcement learning. The crawler is regarded as an agent and deep web database as the environment. The agent perceives its current state and submits a selected action (query) to the environment according to Q-value. Based on the framework we develop an adaptive crawling method. Experimental results show that it outperforms the state of art methods in crawling capability and breaks through the assumption of full-text search implied by existing methods.

Downloads

Published

2010-07-05

How to Cite

Wu, Z., Jiang, L., Zheng, Q., & Liu, J. (2010). Learning to Surface Deep Web Content. Proceedings of the AAAI Conference on Artificial Intelligence, 24(1), 1967–1968. https://doi.org/10.1609/aaai.v24i1.7779

Download Citation

Issue

Vol. 24 No. 1 (2010): Twenty-Fourth AAAI Conference on Artificial Intelligence

Section

Student Abstracts

Learning to Surface Deep Web Content

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information