Patent attributes
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying objects in images. One of the methods includes obtaining a first training image; down-sampling the first training image to generate a low-resolution first training image; processing the low-resolution first training image using a first neural network to generate a plurality of features of the low-resolution first training image and first scores for the low-resolution first training image; processing the first scores and the features of the low-resolution first training image using an initial patch locator neural network to generate an initial location of an initial patch of the first training image; locally perturbing the initial location to select an adjusted location for the initial patch of the first training image; and updating the current values of the parameters of the initial patch locator neural network to generate updated values using the adjusted location.