Patent attributes
Systems, device and techniques are disclosed for training data generation for visual search model training. A catalog including catalog entries which may include images of an item and data about the item may be received. Labels may be applied to the images of the items based on the data about the items. The images of the items may be sorted into clusters using cluster analysis on the labels. Each cluster may include labels as categories of the cluster. Additional images may be received based on searching for the categories. Generative adversarial network (GAN) training data sets may be generated from the images of the items, the additional images, and the categories. GANs may be trained with the GAN training data sets. The GANs may generate images including images of generated items, which may be replaced with images of items from the catalog entries to create feature model training images.