Patent attributes
A method, a storage medium and a system for document content reconstruction are provided in a digital content delivery and online education services platform to enable delivery of textbooks and other copyrighted material to multi-platform web browser applications. The method comprises ingesting a document page in an unstructured document format. The method further comprises extracting one or more images and metadata associated with the images and text and fonts associated with the texts from the document page. In addition, the method comprises coalescing text into paragraphs and creating a structured document page in a markup language format using the extracted images, text and fonts rendered with layout fidelity to the original ingested document page.