Embodiments of the system in includes data set analytics to identify one or more dataset utilized by a workflow. The data set analytics identifies upstream data sets referenced by the data sets utilized by the workflow. The set of all data sets relevant to the workflow are considered applicable data sets, and are analyzed. The data set analytics determines a usage pattern of each of the applicable data sets by the workflow, and identifies one or more data quality assertions for each of the applicable of data sets based on the usage pattern. The data set analytics further perform a quality evaluation of the applicable data sets by applying data quality assertions to the applicable data sets used by the workflow.