Patent attributes
In one aspect, a playback device includes a command-keyword engine having a local natural language unit (NLU). The playback device detects, via the command-keyword engine, a first command keyword in voice input of sound detected by one or more microphones of the playback device. The playback device determines whether the sound input data includes a keyword from a first predetermined library of keywords via a local natural language unit (NLU). The playback device transmits the input sound data to a second playback device over a local area network, the second playback device employing a second local NLU with a second predetermined library of keywords. The playback device receives a response from the second playback device and performs an action based on an intent determined by at least one of the first NLU or the second NLU according to the keywords in the voice input.