Patent attributes
Techniques for identifying and correcting synchronization errors for a media file are described herein. A first file that includes a first set of words comprising lyrics of a media file may be maintained. One or more portions of the media file that represent vocal audio may be separated from other portions of the media file that represent instrumental audio by a computer system. A second file may be generated based at least in part on using an automated speech recognition on the separated one or more portions of the media file. The second file my include time stamps for a second set of words comprising the lyrics in the separated one or more portions of the media file. The first file may be modified with an offset time value that is determined by aligning the first set of words with the second set of words.