Patent attributes
An object detector includes an input interface to accept a sequence of video frames, a memory to store a neural network trained to detect objects in the video frames, a processor to process each video frame sequentially with the neural network to detect objects in the sequence of video frames, and an output interface to output the object detection information. The neural network includes a first subnetwork, a second subnetwork, and a third subnetwork. The first subnetwork receives as an input a video frame and outputs a feature map of the video frame. The second subnetwork is a recurrent neural network that takes the feature map as an input and outputs a temporal feature map. The third subnetwork takes the temporal feature map as an input and outputs object detection information.