Patent attributes
Mechanisms are provided for ingesting natural language textual content. Ingestion of natural language textual content is initiated and an embedded structured data portion within the natural language textual content is identified. A signature of the structured data portion is generated which comprises one or more metadata elements describing the configuration or content of the structured data portion. References to the structured data portion are identified in natural language text portions of the natural language textual content and evaluated based on the signature. An ingestion prioritization plan for ingesting a set of facts associated with a set of elements of the structured data portion is generated based on results of the evaluation. The ingestion prioritization plan is applied to generate the set of facts and store the set of facts in an ingested representation of the natural language textual content.