Aspects of the present disclosure relate to ignoring trigger words of a buffered media stream. A buffered media stream of media content is accessed in advance of the playing the media stream. One or more trigger words in the media content of the buffered media stream are identified. A time stamp is generated for each of the one or more identified trigger words in relation to a play time of the media content of the buffered media stream. A voice command device is instructed to ignore audio content of the buffered media stream based on the time stamp for each of the one or more identified trigger words while the buffered media stream is played.