US Patent 9165249 Information processing apparatus, information processing method, and program

Is a

Patent

Patent attributes

Current Assignee

Sony

Patent Jurisdiction

United States Patent and Trademark Office

Patent Number

9165249

Date of Patent

October 20, 2015

Patent Application Number

13611535

Date Filed

September 12, 2012

Patent Citations Received

‌

US Patent 11928556 Removing unnecessary history from reinforcement learning state

Patent Primary Examiner

‌

Jeffrey A. Gaffin

Patent abstract

Provided is an information processing apparatus including: a reward estimator generating unit using action history data, which includes state data expressing a state of an agent, action data expressing an agent's action, and a reward value expressing a reward of the action, as learning data to generate, through machine learning, a reward estimator estimating the reward value from inputted state data and action data; an action selecting unit preferentially selecting an action not included in the action history data but with a high estimated reward value; and an action history adding unit causing the agent to perform the selected action and adding to the action history data the state data and action data for the action and the action's reward value in association with each other. The reward estimator is regenerated when a set of state data, action data, and the reward value is added to the action history data.

Timeline

No Timeline data yet.

Further Resources

Title

Author

Link

Type

Date

No Further Resources data yet.

US Patent 9165249 Information processing apparatus, information processing method, and program

Contents

Patent attributes

Timeline

Further Resources

References

Find more entities like US Patent 9165249 Information processing apparatus, information processing method, and program