Patent 10504145 was granted and assigned to GumGum on December, 2019 by the United States Patent and Trademark Office.
Systems and methods are provided for generating and using classification models to automatically classify pages or other source content as including text about one or more real-world events. Generating the classification models may include analyzing text content of a large number of different pages from both a reference source and from more dynamic sources, such as from publisher sources via a network. Features for training classifiers may be determined based in part on the top n-grams identified among pages that have been determined to be associated with a given event.