Patent attributes
In one aspect, the present disclosure relates to a method which, in one embodiment, includes: receiving video data for a first video; deconstructing the video data of the first video into a plurality of context windows; performing, on each context window that includes an image frame, a video analytic function on the image frame to identify one or more characteristics of the context window; performing, on each context window that includes an audio frame, a video analytic function on the audio frame to identify one or more characteristics of the context window; generating, for each context windows, a respective local atomic unit comprising attributes derived from the identified one or more characteristics of the respective context window, to form a plurality of local atomic units; and generating a local graph representation of the first video, comprising a plurality of nodes corresponding to the plurality of local atomic units.