Patent attributes
Use of a deduplication module remote from a storage system for storage of data subject to deduplication by the remote deduplication module. The storage system may generate a hash of target data and send the hash of the target data to the remote deduplication module. The remote deduplication module may access a hash database corresponding to the storage system and apply a deduplication algorithm on the hash. In turn, information regarding a deduplication result may be returned to the storage system which may either store the unique target data or update a mapping between the target data and duplicate existing data in a persistent storage of the storage device. The remote deduplication module may be executed by scalable cloud resources such that the computational resources dedicated to the deduplication module may be scaled.