Patent attributes
Techniques are provided for caching of machine learning model training parameters. One method comprises training a machine learning model using a given training dataset; and caching a parameter of the machine learning model from the training with the given training dataset. The cached parameter of the machine learning model is used for a subsequent training of the machine learning model. The caching may be performed after each of multiple iterations of the training of the machine learning model. A given cached iteration of the training of the machine learning model may be identified using a key based on: (i) a hash of the given training dataset, (ii) a hash of the machine learning model parameter, and/or (iii) hyperparameters of the machine learning model. The caching of a given iteration of the machine learning model may occur when the given cached iteration is not found in a cache memory.