Patent attributes
Techniques for identifying similarity between a primary content item and candidate content items are disclosed. The techniques include receiving a similarity request from a client device, the similarity request including contextual data about a primary content item; determining a vector number for the primary content item using the contextual data; identifying candidate content items based on the contextual data; retrieving vector numbers for the identified candidate content items; computing a distance between the vector number of the primary content item and the vector number of each of the candidate content items; ranking the candidate content items based on their computed distance from the primary content item; and selecting at least a subset of the ranked candidate content items as similar content items and communicating the selected subset of the ranked candidate content items to the client device for display on a display of the client device.