Patent attributes
A spatiotemporal action detection method includes performing object detection on all frames of a sample video to obtain a candidate object set; calculating all interframe optical flow information on the sample video to obtain a motion set; constructing a spatiotemporal convolution-deconvolution network of an attention mechanism and a motion attention mechanism of an additional object; adding both a corresponding sparse variable and a sparse constraint to obtain a network structure S after performing spatiotemporal convolution processing on each time segment of the sample video; training the network structure S with an objective function based on classification loss and loss of the sparse constraint of cross entropy; and calculating an action category and a sparse coefficient corresponding to each time segment of a test sampled video to obtain an object action spatiotemporal location.