A Novel Architecture for Deep Web Crawler

A Novel Architecture for Deep Web Crawler

Dilip Kumar Sharma, A. K. Sharma
Copyright: © 2018 |Pages: 25
ISBN13: 9781522531630|ISBN10: 1522531637|EISBN13: 9781522531647
DOI: 10.4018/978-1-5225-3163-0.ch015
Cite Chapter Cite Chapter

MLA

Sharma, Dilip Kumar, and A. K. Sharma. "A Novel Architecture for Deep Web Crawler." The Dark Web: Breakthroughs in Research and Practice, edited by Information Resources Management Association, IGI Global, 2018, pp. 334-358. https://doi.org/10.4018/978-1-5225-3163-0.ch015

APA

Sharma, D. K. & Sharma, A. K. (2018). A Novel Architecture for Deep Web Crawler. In I. Management Association (Ed.), The Dark Web: Breakthroughs in Research and Practice (pp. 334-358). IGI Global. https://doi.org/10.4018/978-1-5225-3163-0.ch015

Chicago

Sharma, Dilip Kumar, and A. K. Sharma. "A Novel Architecture for Deep Web Crawler." In The Dark Web: Breakthroughs in Research and Practice, edited by Information Resources Management Association, 334-358. Hershey, PA: IGI Global, 2018. https://doi.org/10.4018/978-1-5225-3163-0.ch015

Export Reference

Mendeley
Favorite

Abstract

A traditional crawler picks up a URL, retrieves the corresponding page and extracts various links, adding them to the queue. A deep Web crawler, after adding links to the queue, checks for forms. If forms are present, it processes them and retrieves the required information. Various techniques have been proposed for crawling deep Web information, but much remains undiscovered. In this paper, the authors analyze and compare important deep Web information crawling techniques to find their relative limitations and advantages. To minimize limitations of existing deep Web crawlers, a novel architecture is proposed based on QIIIEP specifications (Sharma & Sharma, 2009). The proposed architecture is cost effective and has features of privatized search and general search for deep Web data hidden behind html forms.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.