Patent attributes
Devices, systems and methods are disclosed for improving a playback of video data and generation of a video summary. For example, annotation data may be generated for individual video frames included in the video data to indicate content present in the individual video frames, such as faces, objects, pets, speech or the like. A video summary may be determined by calculating a priority metric for individual video frames based on the annotation data. In response to input indicating a face and a period of time, a video summary can be generated including video segments focused on the face within the period of time. The video summary may be directed to multiple faces and/or objects based on the annotation data.