Patent attributes
The embodiments described herein enable the efficient storage and searching of large amounts of data to retrieve desired data with high accuracy and fast response time. The embodiments include for example parsing an input document containing input data into multiple sub-documents based on the structure of the input document and generating for each sub-document, a unique ID, pathway, and metadata. The embodiments also include storage of the ID and pathway for each sub-document in an index database and storage of the input document itself in a separate database. When a query for information from the input document is received, the ID and pathway of the sub document containing the queried information are identified and used to locate the queried information from the input document.