Patent attributes
Non-linear classifiers and dimension reduction techniques may be applied to text classification. Non-linear classifiers such as random forest, Nyström/Fisher, and others, may be used to determine criteria usable to classify text into one of a plurality of categories. Dimension reduction techniques may also be used to reduce feature space size. Machine learning techniques may be used to develop criteria (e.g., trained models) that can be used to automatically classify text. Automatic classification rates may be improved and result in fewer numbers of text samples being unclassifiable or being incorrectly classified. User-generated content may be classified, in some embodiments.