Patent 10061773 was granted and assigned to CA (journal) on August, 2018 by the United States Patent and Trademark Office.
A computing device is configured to parse a selected semi-structured or unstructured digital document. Once a document is selected, an appropriate parser is selected based on the content type of the document. The document is parsed and the data output to a metadata file. Additionally, any nested documents that are included in the selected document are also parsed using an appropriate parser with the being output to the metadata file. Once complete, the metadata file can be stored and analyzed by a user or administrator using tools that are used to model structured data files.