Patent attributes
Disclosed are systems and methods for data mining a plurality of records to identify one or more patterns. A list of frequent items is generated using the records of a certain subpopulation in a dataset of the records. By scanning through the dataset, a prefix tree is generated based on the list of frequent items. Each node in the prefix tree includes an accumulator which maintains separate counts of records from the subpopulation matching the respective node and of records from the plurality of records matching the respective node. One or more population-normalized frequent patterns associated with the plurality of records are extracted based on a traversal of the prefix tree.