sampleGridSequence

(optional) Defines the action selection mode of the reinforcement learning agent. Default: <code>random</code>.

actionSelection

(optional) Control parameters defining the behavior of the agent.
Default: <code>alpha = 0.1</code>; <code>gamma = 0.1</code>; <code>epsilon = 0.1</code>.

control

(optional) Existing model of class <code>rl</code>. Default: <code>NULL</code>.

model

Additional parameters passed to function.

Function uses an environment function to generate sample experience in the form of state transition tuples.

Performs model-free reinforcement learning in R. This implementation enables the learning
of an optimal policy based on sample sequences consisting of states, actions and rewards. In
addition, it supplies multiple predefined reinforcement learning algorithms, such as experience
replay. Methodological details can be found in Sutton and Barto (1998) <ISBN:0262039249>.

sampleGridSequence: Sample grid sequence

Description

Usage

Arguments

Value

See Also