Patent attributes
A method comprises obtaining, by a computing system, first audio data representing one or more initial utterances during an interactive voice session with an interactive voice system; generating, by the computing system, based on the first audio data, a prediction regarding whether a subsequent utterance of a user during the interactive voice session will contain sensitive information, the subsequent utterance following the one or more initial utterances in time; obtaining, by the computing system, second audio data representing the subsequent utterance; determining, by the computing system, based on the prediction, whether to transmit the second audio data; and based on a determination not to transmit the second audio data: replacing, by the computing system, the second audio data with third audio data that is based on a voice of the user; and transmitting, by the computing system, the third audio data.