Patent attributes
A home assistant device captures voice signal expressed by users in the home and extracts vocal features from these captured voice recordings. The device collects data about the current context in the home and requests from an aggregator a background model that is best adapted to the current context. This background model is obtained and locally used by the home assistant device to perform the speaker recognition. Home assistant devices from a plurality of homes contribute to the establishment of a database of background models by aggregating vocal features, clustering them according to the context and computing background models for the different contexts. These background models are then collected, clustered according to their contexts and aggregated by an aggregator in the database. Any home assistant device can then request from the aggregator the background model that fits best its current context, thus improving the speaker recognition.