A controllable grounded response generation framework includes a machine learning model, a grounding interface, and a control interface. The machine learning model is trained to output computer-generated text based on input text. The grounding interface is useable by the machine learning model to access a grounding source including information related to the input text. The control interface is useable by the machine learning model to recognize a control signal. The machine learning model is configured to include information from the grounding source in the computer-generated text and focus the computer-generated text based on the control signal.