A method of annotating audio-visual data is disclosed. The method includes detecting a plurality of facial expressions in an audience based on a stimulus, determining an emotional response to the stimulus based on the facial expressions and generating at least one annotation of the stimulus based on the determined emotional response.