Patent attributes
A method including obtaining image data and attribute information of a first item in an item catalog. The method also can include generating candidate variant items from the item catalog for the first item using a combination of (a) a k-nearest neighbors approach to search for first candidate variant items based on text embeddings for the attribute information of the first item, and (b) an elastic search approach to search for second candidate variant items based on image embeddings for the image data of the first item. The method additionally can include performing respective classifications based on respective pairs comprising the first item and each of the candidate variant items to filter the candidate variant items. The method further can include determining a respective distance between the first item and each of the candidate variant items, as filtered. The method additionally can include determining one or more items in the candidate variant items, as filtered, to include in a variant group for the first item, based on a decision function using a predetermined threshold and the respective distance for the each of the candidate variant items, as filtered. Other embodiments are described.