Patent attributes
A computer processor generates a topic-based dataset based on parsing content received from a plurality of information sources, which includes historical data and scientific data, associated with a location of a natural resource. The processor generates a plurality of clusters, respectively corresponding to like-topic data of the topic-based dataset. The processor determines a plurality of hypotheses, respectively corresponding to the plurality of clusters of the like-topic data, wherein the plurality of hypotheses are based on features associated with each of the plurality of clusters of the like-topic data. The processor combines pairs of clusters, based on a similarity heuristic applied to the one or more pairs of clusters, and the processor determines a plurality of probabilities respectively corresponding to a validity of each hypothesis of the plurality of hypotheses, associated with the location of a natural resource.