Is a
Patent attributes
Patent Applicant
Current Assignee
Patent Jurisdiction
Patent Number
Patent Inventor Names
Koray Kavukcuoglu0
Volodymyr Mnih0
Date of Patent
June 29, 2021
0Patent Application Number
156193930
Date Filed
June 9, 2017
0Patent Citations Received
Patent Primary Examiner
Patent abstract
We describe a method of reinforcement learning for a subject system having multiple states and actions to move from one state to the next. Training data is generated by operating on the system with a succession of actions and used to train a second neural network. Target values for training the second neural network are derived from a first neural network which is generated by copying weights of the second neural network at intervals.
Timeline
No Timeline data yet.
Further Resources
No Further Resources data yet.