Patent attributes
Methods and computer-readable media for repeated holdout validation include collecting independent data representing independent variables; collecting dependent data representing a dependent variable; correlating the independent data with the dependent data; creating a data set comprising the correlated independent and dependent data; generating a plurality of unique seeds; creating a plurality of training sets and a plurality of validation sets; associating each training set with a single validation set; training the neural network a plurality of times with the training sets and seeds to create a plurality of models; calculating accuracy metric values for the models using the validation sets associated with the training sets used to create respective models; performing a statistical analysis of the accuracy metric values; and ranking the independent variables by a strength of correlation of individual independent variables with the dependent variable, when a metric of the statistical analysis exceeds a threshold.