Is a
Patent attributes
Patent Jurisdiction
Patent Number
Date of Patent
June 22, 2021
Patent Application Number
16026723
Date Filed
July 3, 2018
Patent Citations
Patent Primary Examiner
Patent abstract
Systems, methods, and computer program products relating to clustering unstructured data. A set of unstructured documents is tokenized to produce a plurality of tokens. A frequency at which terms appear in the plurality of tokens is analyzed, to generate a vocabulary of terms. A vocabulary indices matrix is generated based on the generated vocabulary of terms. The matrix relates to the set of unstructured documents. A plurality of rows in the vocabulary indices matrix are matched to generate a plurality of clusters for the set of unstructured documents.
Timeline
No Timeline data yet.
Further Resources
No Further Resources data yet.