Patent attributes
Various types of objects or occurrences can be automatically detected in input media being processed using a transcoder. The media content can be analyzed to determine various transitions, such as scene changes, which provide insight into useful locations for performing object recognition. Representative frames subsequent a transition are analyzed to determine whether they are appropriate for image analysis, using factors such as amount of motion, brightness, color, or pixel disparity within the frame. If a representative frame meets the various criteria, that frame is sent to an object recognition service for analysis. The output of the service can be a set of object tags that provide information identifying the object and its location in the media. The output tags can be encoded into the output video or stored to an associated metadata file, among other such options.