Patent 11132211 was granted and assigned to Apple (company) on September, 2021 by the United States Patent and Trademark Office.
A system includes a state-dependent action policy and a state-dependent transition policy. The state-dependent action policy determines an action based on environment states and a current agent state selected from a predetermined group of agent states. The state-dependent transition policy is implemented using one or more machine learning models and is configured to control transitions between agent states from the predetermined group of agent states.