Embodiments for deduplication of snapshots data in a data storage by a processor. Input snapshot data is partitioned into input similarity units. An input similarity unit is deduplicated with a corresponding similarity unit of a previous snapshot. Deduplication coverage is examined. If a deduplication coverage threshold is not met, a similarity search is conducted and the input similarity unit is deduplicated with a found similarity unit.