A method for audio and video synchronization, includes: receiving audio data sent by the video player, wherein the audio data includes more than one audio sampling points; obtaining a network delay change between the audio player and the video player, wherein the network delay change refers to a time difference between the network delay monitored for the first time and the network delay monitored for the last time; determining a total amount of audio data to be adjusted according to the network delay change; and adding audio sampling points to the received audio data or deleting audio sampling points from the received audio data according to the total amount of audio data to be adjusted and a preset adjustment strategy, so that the audio player is synchronized with the video player, and playing the added or deleted audio data.