Patent attributes
An apparatus in one embodiment comprises at least one processing device having a processor coupled to a memory. The processing device implements a first workload distribution node configured to communicate with multiple distributed data processing clusters over at least one network. The workload distribution node is further configured to receive a data processing request, to identify particular ones of the distributed data processing clusters that are suitable for handling at least a portion of the data processing request, and to assign the data tasks to one or more of the distributed data processing clusters. Results of performance of the data tasks from the one or more assigned distributed data processing clusters are received by the first workload distribution node and aggregated into a response that is returned to a source of the data processing request. The source of the data processing request in some embodiments is another workload distribution node.