Is a
Patent attributes
Current Assignee
Patent Jurisdiction
Patent Number
Date of Patent
May 10, 2011
Patent Application Number
11270014
Date Filed
November 9, 2005
Patent Primary Examiner
Patent abstract
A computer-implemented method of generating a dynamic corpus includes generating web threads, based upon corresponding sets of words dequeued from a word queue, to obtain web thread resulting URLs. The web thread resulting URLs are enqueued in a URL queue. Multiple text extraction threads are generated, based upon documents downloaded using URLs dequeued from the URL queue, to obtain text files. New words are randomly obtained from the text files, and the randomly obtained words from the text files are enqueued in the word queue. This process is iteratively performed, resulting in a dynamic corpus.
Timeline
No Timeline data yet.
Further Resources
No Further Resources data yet.