Is a
Patent attributes
Patent Applicant
Current Assignee
Patent Jurisdiction
Patent Number
Date of Patent
February 14, 2023
Patent Application Number
16653890
Date Filed
October 15, 2019
Patent Citations
Patent Primary Examiner
CPC Code
Systems and methods are provided for efficient off-policy credit assignment (ECA) in reinforcement learning. ECA allows principled credit assignment for off-policy samples, and therefore improves sample efficiency and asymptotic performance. One aspect of ECA is to formulate the optimization of expected return as approximate inference, where policy is approximating a learned prior distribution, which leads to a principled way of utilizing off-policy samples. Other features are also provided.
Timeline
No Timeline data yet.
Further Resources
No Further Resources data yet.