US Patent 11893032 Measuring relevance of datasets to a data science model

A computer-implemented method, a computer program product, and a computer system for measuring relevance of datasets to data science models. One or more servers implement steps: extract keywords in each data science model; determine first relative frequencies of the respective keywords in each data science model, for each source group in the data science models; extract keywords in each dataset; determine second relative frequencies of the respective keywords in each dataset, for each source group in the datasets; determine weights of the keywords; calculate first aggregated relevant scores of the respective keywords in each data science model, based on the first relative frequencies and the weights; calculate second aggregated relevant scores of the respective keywords in each dataset, based on the second relative frequencies and the weights. One or more servers calculate similarity between vectors of the first and second aggregated relevant scores, based on a similarity measure between vectors.

Timeline

No Timeline data yet.

Further Resources

Title

Author

Link

Type

Date

No Further Resources data yet.

US Patent 11893032 Measuring relevance of datasets to a data science model

Contents

Patent attributes

Timeline

Further Resources

References

Find more entities like US Patent 11893032 Measuring relevance of datasets to a data science model