policy: Calculates the reinforcement learning policy
Description
Calculates reinforcement learning policy from a given state-action table Q.
The policy is the decision-making function of the agent and defines the learning
agent's behavior at a given time.
Usage
policy(x)
Arguments
x
Variable which encodes the behavior of the agent. This can be
either a matrix, data.frame or an rl object.