A method is disclosed to estimate the delay between an original signal and the corresponding captured signal. The signals are transformed and buffered to two sets of spectral descriptors for a similarity measure. The method advantageously offers robust delay estimation for inconsistent delays and adverse spectral distortions.