Patent attributes
This disclosure relates to generating a comprehensive set of synthetic utterances. An example system is configured to provide an input utterance to a plurality of synthetic utterance generation pipelines in parallel. Each of the plurality of synthetic utterance generation pipelines include one or more utterance synthesizers. For example, one or more pipelines may use a synthesizer chain that includes a plurality of synthesizers in parallel. The plurality of synthetic utterance generation pipelines generates synthetic utterances, which may be stored in a database after evaluating the similarity between the original input utterance and each resulting synthetic utterance. For example, a synthetic utterance may be retained if the cosine similarity between the input and synthetic utterances is less than a predetermined threshold. Additionally, the synthetic utterances may be fed back at input utterances based on the similarity evaluation and the feedback loop repeated until a desired number of utterances are generated.