Patent attributes
Described herein are systems and methods that search videos and other media content to identify items, objects, faces, or other entities within the media content. Detectors identify objects within media content by, for instance, detecting a predetermined set of visual features corresponding to the objects. Detectors configured to identify an object can be trained using a machine learned model (e.g., a convolutional neural network) as applied to a set of example media content items that include the object. The systems provide user interfaces that allow users to review search results, pinpoint relevant portions of media content items where the identified objects are determined to be present, review detector performance and retrain detectors, providing search result feedback, and/or reviewing video monitoring results and analytics.