Patent attributes
Methods and systems for condensing a sequential data set on a computer system. A sequential data set is received by the computer system and analyzed to identify a number of occurrences in the sequential data set of each of a plurality of unique data value pairs. A condensed data set is generated including a data element for each of the unique data value pairs. Each data element in the condensed data set includes an identification of the first data value and the second data value of the unique data value pair as well as a count indicative of the number of occurrences in the sequential data set of the first data value immediately followed by the second data value. A graphical signature is generated including a plurality of vectors each extending between two nodes corresponding to the data values of a different data element.