Patent 10332210 was granted and assigned to Nationwide Mutual Insurance Company on June, 2019 by the United States Patent and Trademark Office.
Improved computer modeling techniques, and uses thereof, are described herein. A set of unstructured textual data is received. Certain textual data is removed from the set of unstructured data to form an initial vocabulary set of textual data. One or more bigrams are added to the vocabulary set of textual data to form a final vocabulary set of textual data. The final vocabulary set of textual data is divided into a plurality of subsets of textual data based on type. A model is trained using each of the plurality of subsets of textual data to form a plurality of trained models, each corresponding to one of the types.