This disclosure describes, in part, techniques for collecting image data representing item identifiers, such as barcodes. For instance, system(s) may receive image data representing images, where the images depict at least a portion of an identifier located on an item. The system(s) may then identify a first portion of the image data representing an image that that is associated a low confidence level. Next, the system(s) may identify a second portion of the image data representing additional images that are associated high confidence levels. Using results for this the second portion of the image data, the system(s) may determine a ground truth result for the first portion of the image data. The system(s) may then store, in one or more databases, data representing the ground truth result in association with the first portion of the image data.