Patent attributes
Embodiments of the present invention relate to a method, system, and computer program product for compacting data in a distributed storage system. In some embodiments, a method is disclosed. According to the method, first information about historical resource usage of a data node and second information about a historical data compaction performed at the data node are obtained. The historical data compaction compacted data files associated with a first database table into one data file. A configuration for a data compaction to be performed on a plurality of data files associated with a second database table is generated based on the first and second information. The data compaction is performed during a future time period. In other embodiments, a system and a computer program product are disclosed.