Patent attributes
A computer-implemented method is provided for evaluating a next action of a target object in an environment. The method includes simulating, by a hardware processor for each of possible actions of the target object in the environment, a next state occurring thereafter to obtain a plurality of simulated next states, based on a pessimistic scenario which is randomly generated by sampling a plurality of costs from a distribution of cost. The distribution of cost is an area where the target object is likely to visit in a near future. The method further includes identifying, by the hardware processor, a safety area for the target object in each of the plurality of simulated next states. The method also includes evaluating, by the hardware processor, each of the possible actions of the target object, based on the safety area.