In one implementation, a network device receives a series of video frames spanning a series time period from a network. The time period may be static of a specific duration or identified dynamically based on the timing of the video frames. The network device decodes the series of video frames and stores the decoded series of data frames in memory along with associated timestamps. A pause phenomenon in excess of expected delay is identified between two of the decoded video frames. The network device forwards or displays the series of decoded frames such that each successive pair of frames in the series of decoded frames are spaced by a resultant delay period that is less than a duration of the pause phenomenon.