Patent 10223585 was granted and assigned to Adobe Inc. on March, 2019 by the United States Patent and Trademark Office.
Disclosed systems and methods generate page segmented documents from unstructured vector graphics documents. The page segmentation application executing on a computing device receives as input an unstructured vector graphics document comprising drawing commands. The application generates an element proposal for each of many areas on a page of the input document tentatively identified as being page elements. Each of the element proposals may be generated at least in part based on the drawing commands. The page segmentation application classifies each of the element proposals into one of a plurality of defined type of categories of page elements at least in part based on the drawing commands. The page segmentation application may further refine at least one of the element proposals and select a final element proposal for each element within the unstructured vector document. One or more of the page segmentation steps may be performed using a neural network.