A system and method are described for smart noise cancellation. In some implementations, the system may receive audio data describing an audio signal, which the system may use to determine a set of frames of the audio signal. Spectral analysis, which may include a signal-to-noise ratio estimate, may be performed on the one or more frames of the audio. In some instances, the system may identify a noise frame from among the one or more frames based on the spectral analysis, and may reduce noise in the one or more frames based on the noise frame and the spectral analysis on the one or more frames of the audio signal.