Patent attributes
A method and system for providing data imbalance detection and validation for a trained a ML model includes receiving a request to perform data imbalance detection on the trained ML model, identifying a feature of a dataset associated with the trained model for which data imbalance detection is to be performed, receiving access to the dataset, receiving access to the trained ML model, examining at least one of the dataset or outcome data generated by the trained ML model to determine a distribution of the feature or a distribution of the outcome data, and determining if the trained ML model exhibits data imbalance based at least in part on the distribution of the feature or the distribution of the outcome data.