Patent attributes
One or more systems, devices, computer program products and/or computer-implemented methods of use provided herein relate to outputting an optimal decision policy base on informal knowledge input. A system can comprise a memory that stores computer executable components, and a processor that executes the computer executable components stored in the memory, wherein the computer executable components can comprise an analysis component that analyzes an input dataset comprising a constraint in a natural language form, and an augmentation component that generates an influence mapping comprising a constraint variable based on the constraint input. In an embodiment, an input dataset employed to support the influence mapping can comprise time-stamped tuple data comprising a state, an action and a reward. In an embodiment, an inference engine can generate an output policy in response to the constraint input and which output policy can be based on the constraint input and constraint variable.