Patent attributes
Embodiments are directed to controlling playback of recordings. The recording can comprise an audio recording, audio/visual recording, voicemail message, or other recording having an audio component. According to one embodiment, a method can comprise capturing an audio recording of speech of at least one person and determining, a context for each of a plurality of portions of the audio recording based on natural language processing of the audio recording. One or more transition points between the portions of the audio recording can be identified. Each transition point can indicate a change in the determined context between the portions. A playback interface providing a representation of the audio recording and each of the identified transition points can be presented and the audio recording can be played based on input received through the playback interface.