Patent attributes
An apparatus in one embodiment comprises a distributed data processing system in which multiple processing devices communicate with one another over at least one network. The distributed data processing system is configured to obtain reads of biological samples of respective sample sources, with each of the biological samples containing genomic material from a plurality of distinct microorganisms within an environment of a corresponding one of the sample sources, and to perform distributed data analytics to provide surveillance functionality characterizing at least one of a disease, an infection and a contamination as involving genomic material from multiple ones of the sample source. Performing distributed data analytics illustratively comprises performing local analytics in respective ones of a plurality of data zones, and performing global analytics utilizing results of the local analytics performed in the respective data zones.