Patent attributes
Systems and methods are disclosed for refining the accuracy of network searches by supplementing existing keywords and key phrases in an e-commerce catalog or other database with aggregated and analyzed additional, external data. The internet or another network can be crawled for identifiers which point to entries in the catalog or other database, and, subject to third-party use restrictions, data and metadata can be extracted to enrich the existing keywords and key phrases. The extracted external content may be processed by machine learning techniques in order to find similar entries in the original catalog or database. Categorizing and indexing the entries further improves search recall, including clustering via processing word embeddings.