An amplifier-embedded video surveillance IP speaker system is disclosed. The present disclosure includes an IP video device, an IP audio device, and a sensor, wherein audio data of a monitor agent using a remote user terminal is transmitted to an amplifier-embedded IP speaker having an assigned IP address to then be output, or wherein a remote control command is transmitted to an amplifier-embedded IP speaker, thereby outputting a warning sound.