Patent attributes
Embodiments of the present invention train multiple Perception models to predict contextual metadata (tags) with respect to target content items. By extracting context from content items, and generating associations among the Perception models, individual Perceptions trigger one another based on the extracted context to generate a more robust set of contextual metadata. A Perception Identifier predicts core tags that make coarse distinctions among content items at relatively higher levels of abstraction, while also triggering other Perception models to predict additional perception tags at lower levels of abstraction. A Dense Classifier identifies sub-content items at various levels of abstraction, and facilitates the iterative generation of additional dense tags across integrated Perceptions. Class-specific thresholds are generated with respect to individual classes of each Perception to address the inherent sampling bias that results from the varying number and quality of training samples (across different classes of content items) available to train each Perception.