Patent attributes
One embodiment of the present invention provides a system that characterizes computer system workloads. During operation, the system collects metrics for a number of workloads of interest as the workloads of interest execute on a computer system. Next, the system uses the collected metrics to build a statistical regression model, wherein the statistical regression model uses a performance indicator as a response, and uses the metrics as predictors. The system then defines a distance metric between workloads, wherein the distance between two workloads is a function of the differences between metric values for the two workloads. Furthermore, these differences are weighted by corresponding coefficients for the metric values in the statistical regression model.