Is a
Patent attributes
Patent Jurisdiction
Patent Number
Patent Inventor Names
Neeraj Agrawal0
Sreeram Viswanath Balakrishnan0
Sachindra Joshi0
Date of Patent
February 1, 2011
0Patent Application Number
120544820
Date Filed
March 25, 2008
0Patent Primary Examiner
Patent abstract
A method (100) of crawling the Web (620) is disclosed. The method (100) crawls (120) Web pages on the Web starting from a given (110) set of seed Universal Resource Locators (URLs). Crawled Web pages are partitioned (140) into sets of relevant and irrelevant pages. A set of exclusion and/or inclusion patterns are discovered (150) from the sets of relevant and irrelevant pages, and subsequent crawling of the Web is restricted through the set of exclusion and/or inclusion patterns.
Timeline
No Timeline data yet.
Further Resources
No Further Resources data yet.