Patent attributes
A method, system, and program product for data deduplication in a replication environment, the replication environment having a production site, a splitter, and a replication site, wherein the replication site has a journal, comprising determining a digest for each chunk of data of a set of data chunks, determining for each chunk whether the digest is in an index on the production site, determining for which offsets are to be evicted from the cache on the replication site, replacing the chunks in set of chunks that are in the index with an offset, and transmitting the set of chunks, offsets and an eviction list to the replication site.