Patent attributes
Disclosed herein are system, method, and apparatus for generating labels for k-means clusters. The method includes accessing a plurality of data records from a database repository, and storing the plurality of data records into at least one of primary or secondary memory associated with at least one computer processor performing the method, along with a cluster number for each data record. All data records having a same cluster number form a cluster, and each record has been categorized or designated a cluster number out of a total K number of clusters. The method includes for each of a plurality of classification features, performing cluster-based analysis for a first cluster with respect to a single feature to generate a single feature overlap score. The method includes sorting, grouping, and generating a naming label for the first cluster based on the predetermined number of features having the lowest overlap scores.