Patent attributes
Aspects provide multilevel design characterization of a web page via identifying different individual graphic element (text characters, images or graphical control elements) displayed within a web page layout, and determining linear groupings thereof (horizontal rows or vertical columns) as a function of differences in their positioning relative to each other. Aspects further identify clusters of the linear groupings and individual graphic elements as a function of clustering indicia (layout pattern indicia, gap level indicia or cluster group indicia), identify repetitive groupings of the clusters as unique list region collections, and determine a tree structure for the unique list region collections that identifies unique list region collections having more dominant element type, size, alignment, style or class name attribute values within the web page layout as root nodes, and others having less dominant element attribute values as child nodes relative to the root nodes.