Patent attributes
Various systems and methods for implementing distribution of a neural network workload are described herein. A discovery message is encoded that includes a latency requirement and requested resources for a workload of a neural network. A discovery response, from a proximate resource and in response to the discovery message, is decoded and includes available resources of the proximate resource available for the workload based on the requested resources for the workload. The proximate resource is selected to execute the workload based on the available resources of the proximate resource. In response to the discovery response, an offload request is encoded that includes a description of the workload. The description of the workload identifies the node to execute at the proximate resource. In response to the offload request, an input is provided to a ADAS system based on the result.