Patent attributes
An apparatus comprising: an input configured to receive at least two audio signals; a frequency domain transformer configured to transform the at least two audio signals into a frequency domain representation of the at least two signals; a spatial covariance processor configured to generate an observed spatial covariance matrix from the frequency domain representations of the at least two audio signals; a beamformer configured to generate a spatial covariance matrix model comprising at least one beamformer kernel; a matrix factorizer configured to generate a linear magnitude mode! of audio objects; to combine the spatial covariance matrix model and the linear magnitude model; and further configured to determine at least one combination parameter, such that the at least one parameter for the combination attempts to optimise the combination; and a separator configured to cluster the audio objects based on the at least one combination parameter to create separated audio sources.