Various embodiments facilitate voice control of a receiving device, such as a set-top box. In one embodiment, a voice enabled media presentation system (“VEMPS”) includes a receiving device and a remote-control device having an audio input device. The VEMPS is configured to obtain audio data via the audio input device, the audio data received from a user and representing a spoken command to control the receiving device. The VEMPS is further configured to determine the spoken command by performing speech recognition on the obtained audio data, and to control the receiving device based on the determined command. This abstract is provided to comply with rules requiring an abstract, and it is submitted with the intention that it will not be used to interpret or limit the scope or meaning of the claims.