Patent attributes
Mechanisms are provided to implement a patient summary generation engine with deduplication of instances of medical concepts. The patient summary generation engine parses a patient electronic medical record (EMR) to extract a plurality of instances of a medical concept, at least two of which utilize different representations of the medical concept. The patient summary generation engine performs a similarity analysis between each of the instances of a medical concept to thereby calculate, for a plurality of combinations of instances of the medical concept, a similarity metric value. The patient summary generation engine clusters the instances of the medical concept based on the calculated similarity metric values for each combination of instances in the plurality of combinations of instances of the medical concept to thereby generate one or more clusters, and select a representative instance of the medical concept from each cluster in the one or more clusters. The patient summary generation engine generates a summary output of the patient EMR comprising the selected representative instances of the medical concept from each cluster.