Patent attributes
Provided is a method, computer program product, and system for classifying a set of data items based on format organizations. A processor may determine at least one format organization of a set of data items. The format organization of a data item indicates a symbol type of at least one continuous symbol in the data item and a number of the at least one continuous symbol. The processor may determine at least one candidate data class for the set of data items from a plurality of predetermined data classes based on the at least one format organization. The processor may classify the set of data items into at least one target data class selected from the at least one candidate data class. In this way, the set of data items can be efficiently classified.