Patent attributes
Improved computer modeling techniques, and uses thereof, are described herein. A set of unstructured textual data is received. Certain textual data is removed from the set of unstructured data to form an initial vocabulary set of textual data. One or more bigrams are added to the vocabulary set of textual data to form a final vocabulary set of textual data. The final vocabulary set of textual data is divided into a plurality of subsets of textual data based on type. A model is trained using each of the plurality of subsets of textual data to form a plurality of trained models, each corresponding to one of the types.