Patent attributes
Disclosed are a speech processing method and a speech processing apparatus in a 5G communication environment through speech processing by executing embedded artificial intelligence (AI) algorithms and/or machine learning algorithms. The speech processing method includes determining a temporary pause of reception of a first spoken utterance, outputting a first spoken response utterance as a result of speech recognition processing of a second spoken utterance received after the temporary pause, determining, as an extension of the first spoken utterance, a third spoken utterance that is received after outputting the first spoken response utterance, deleting, using a deep neural network model, a duplicate utterance part from a fourth spoken utterance that is obtained by combining the first and the third spoken utterance, and outputting a second spoken response utterance as a result of speech recognition processing of the fourth spoken utterance from which the duplicate utterance part has been deleted.