Patent attributes
A device that includes an enterprise data indexing engine (EDIE) configured to determine a first set of similarity scores between a first set of sentences from a first document and a plurality of classification descriptions. The EDIE is further configured to identify one or more classification descriptions that have a similarity score that exceeds a predetermined threshold value. The EDIE is further configured to determine a second set of similarity scores between a second set of sentences from a second document and the plurality of classification descriptions. The EDIE is further configured to identify one or more classification descriptions that have a similarity score that exceeds the predetermined threshold value. The EDIE is further configured to populate a data structure that identifies the tokens within the first set of tokens and the second set of tokens and the number of times each token appears.