selectEpsilonGreedyAction

<p>State-action table of type <code>hash</code>.</p>

state

epsilon

<p>Implements \(\varepsilon\)-greedy action selection. In this strategy, the agent explores the environment
by selecting an action at random with probability \(\varepsilon\). Alternatively, the agent exploits its
current knowledge by choosing the optimal action with probability \(1-\varepsilon\).</p>

Performs model-free reinforcement learning in R. This implementation enables the learning
of an optimal policy based on sample sequences consisting of states, actions and rewards. In
addition, it supplies multiple predefined reinforcement learning algorithms, such as experience
replay. Methodological details can be found in Sutton and Barto (1998) <ISBN:0262039249>.

Nicolas Proellochs

selectEpsilonGreedyAction: Performs \(\varepsilon\)-greedy action selection

Description

Usage

Arguments

Value

References