Patent attributes
A method comprises: acquiring a to-be-compressed topic model, wherein each line of the topic model represents a distribution of a word among respective topics; performing a format conversion on the topic model to obtain a first topic model, wherein each line of the first topic model represents a distribution of a topic among respective words; selecting any two topics from the first topic model to form a topic pair, forming a topic pair set using at least one topic pair, and determining a similarity between the two topics in each topic pair in the topic pair set; merging topic pairs having a similarity greater than a similarity threshold to generate a second topic model; and performing a format conversion on the second topic model to obtain a compressed topic model, so that each line of the compressed topic model represents a distribution of a word among the respective topics.