Patent attributes
Disclosed systems and methods categorize text regions of an electronic document into document object types based on a combination of semantic information and appearance information from the electronic document. A page segmentation application executing on a computing device provides a textual feature representation and a visual feature representation to a neural network. The application identifies a correspondence between a location of the set of pixels in the electronic document and a location of a particular document object type in an output page segmentation. The application further outputs a classification of the set of pixels as being the particular document object type based on the identified correspondence.