Using voice recognition, a user can interact with a companion application to control a video game from a mobile device. Advantageously, the user can interact with the companion application when the video game is unavailable because, for example, of the user's location. Moreover, machine learning may be used to facilitate generating voice responses to user utterances that are predicted to improve or maintain a user's level of engagement with the companion application, or its corresponding video game.