an open-source infrastructure paradigm for building parallel reinforcement learning algorithms in TensorFlow, allowing new algorithms to be developed and trained efficiently
an open-source infrastructure paradigm for building parallel reinforcement learning algorithms in TensorFlow, allowing new algorithms to be developed and trained efficiently
TensorFlow Agents (TF-Agents) is an open-source, efficient infrastructure paradigm for building parallel reinforcement learning algorithms in TensorFlow.
TF-Agents simulates multiple environments in parallel, and groups them to perform the neural network computation on a batch rather than individual observations. This allows the TensorFlow execution engine to parallelize computation, without the need for manual synchronization. Environments are stepped in separate Python processes to progress them in parallel without interference of the global interpreter lock.
An "agent" is a core element of reinforcement learning which encompasses two main responsibilities:
Currently the following algorithms are available under TF-Agents:
In their paper, TensorFlow Agents: Efficient Batched Reinforcement Learning in TensorFlow, authors Hafter, Davidson, and Vanhoucke also introduced BatchPPO, which is an efficient implementation of the proximal policy optimization algorithm.
an open-source infrastructure paradigm for building parallel reinforcement learning algorithms in TensorFlow, allowing new algorithms to be developed and trained efficiently