Patent attributes
A computer-implemented method for training an agent in a first context including an entity and an environment of the entity, to allow an apparatus to perform a navigation task in a second context comprising the apparatus and a physical environment of the apparatus, the apparatus adapted to receive images of the physical environment of the apparatus and comprising a steering device adapted to control the direction of the apparatus, the method comprising: obtaining one or more navigation tasks comprising: generating a navigation task; scoring the navigation task using a machine-learned model trained to estimate the easiness of tasks; in response to the score satisfying a selection criterion, selecting the navigation task as one of the one or more navigation tasks; and training the agent using a reinforcement learning method comprising attempting to perform, by the entity, the one or more navigation tasks using images of the environment of the entity.