Patent attributes
A data extraction and expansion system receives documents with data to be processed, extracts a set of a specific type of entities from the received documents, expands the set of entities by retrieving additional entities of the specific type from an ontology and other external data sources to improve the match between the received documents. The ontology includes data regarding entities and relationships between entities. The ontology is built by extracting the entity and relationship information from external data sources and can be constantly updated. If the additional entities to expand the set of entities cannot be retrieved from the ontology then a real-time search of the external data sources is executed to retrieve the additional entities from the external data sources.