Patent attributes
A Convolutional Neural Network (CNN) includes an initial set of convolutional layers and max pooling units, in which any input is convoluted with the learned image filters and the output is a stack of the different filter responses. Max pooling produces a scaled version of the output. The process can be repeated several times, resulting in a stack of space invariant-scaled images. Since the operation is space invariant, the computations of these layers not need to be recomputed if interested just in certain regions of the image. A Region Of Interest (ROI) Pooling layer is used to select regions to be processed by the set of fully connected layers, which uses the response of the multiple convolutional layers of the network to determine the regions where the objects (of different scales) could be located. This object proposal method is implemented as a Region Of Interest (ROI) Selector.