Patent 11024299 was granted and assigned to Amazon on June, 2021 by the United States Patent and Trademark Office.
Systems, methods, and computer-readable media are disclosed for providing privacy and intent preserving redactions of text derived from utterance data. Certain embodiments provide new techniques for using MadLib-style replacements to replace one or more terms or phrases in a text string. Example methods may include receiving utterance data and determining a public portion and a private portion of the utterance data. Certain methods include determining a cluster of candidates having a same semantic context as the private portion and identifying from within the cluster of candidates a first candidate. Certain methods include determining a redacted utterance comprising the public portion of the utterance and the first candidate. Certain methods include providing the redacted utterance to downstream systems and processes.