Abstract
INTELLIGENT CROWLING TECHNIQUES FOR LOCATING THE SITES
Dr. Sulochana Sonkamble and Dr. Balwant Sonkamble*
ABSTRACT
There are various techniques for crawling the web sites for searching the desired information. The paper titled ?Intelligent Crawling Techniques For Locating the Sites? is focusing on the intelligent technique for searching the desired information which is available on the internet. The variety of information in the form of web pages on the internet is growing tremendously day to day. In this case searching relevant information on the Internet is a crucial and difficult task. This information is hidden behind query forms that interface to unexplored databases containing high quality structured data. Traditional search engines cannot access and index this hidden part of the information. A new term ?Web Retrieving? arises to find this hidden information and become the challenge to the researcher. We have presented a two-stage framework, namely SmartCrawler, for effectively harvesting deep web interfaces. The first stage is called as Site Locating, and a second stage is called as In-Site Exploring. At the first stage of Site locating, the center pages are searched with the help of search engines which in turn avoid visiting a large number of pages. To achieve more precise results for a focused crawl, SmartCrawler ranks websites to prioritize highly relevant ones for a given topic. In the second stage, adaptive link-ranking achieves fast in-site searching by exploring most relevant links. The in-site exploring stage uses adaptive link-ranking to search within a site; and we design a link tree for eliminating bias toward certain directories of a website for wider coverage of web directories.
[Full Text Article] [Download Certificate]