Patent attributes
A machine learning device learns an action of a driving source in a transport device continuously transporting at least two transported objects along a transport path, and includes: a hardware processor that: acquires position information of the at least two transported objects on the transport path on the basis of a result of detection by a sensor provided in the transport path; calculates a reward on the basis of the position information acquired, according to a predetermined rule; learns an action by calculating an action value in reinforcement learning on the basis of the position information acquired and the reward calculated; and generates and outputs control information that causes the driving source to perform an action determined on the basis of a learning result.