Patent attributes
A computer device receives a request to search a corpus of documents for an entity, wherein the request includes a non-name identifier of the entity. The computing device identifies entries of text within the corpus of documents that reference the non-name identifier. The computing device applies natural language processing (NLP) to content associated with the identified entries within the corpus of documents, wherein the NLP identifies candidate entities associated with the non-name identifier. The computing device selects an entity from the candidate entities based, at least in part, on distances between the candidate entities and references to the non-name identifier in the identified entries. The computing device returns the selected entity to a submitter of the request.