RSTD

Learning Rate: $\alpha$ 
$$Q_{new} = Q_{old} + \alpha_{-} \cdot (R - Q_{old}), R &lt; Q_{old}$$
 $$Q_{new} = Q_{old} + \alpha_{+} \cdot (R - Q_{old}), R \ge Q_{old}$$ 
Inverse Temperature: $\beta$ 
$$
 P_{t}(a) = 
 \frac{
 \exp(\beta \cdot Q_{t}(a))
 }{
 \sum_{i=1}^{k} \exp(\beta \cdot Q_{t}(a_{i}))
 }
 $$

A flexible general-purpose toolbox for implementing Rescorla-Wagner models
in multi-armed bandit tasks.
As the successor and functional extension of the 'binaryRL' package,
'multiRL' modularizes the Markov Decision Process (MDP) into six core
components. This framework enables users to construct custom models via
intuitive if-else syntax and define latent learning rules for agents.
For parameter estimation, it provides both likelihood-based
inference (MLE and MAP) and simulation-based inference (ABC and
RNN), with full support for parallel processing across subjects.
The workflow is highly standardized, featuring four main functions
that strictly follow the four-step protocol (and ten rules)
proposed by Wilson & Collins (2019) <doi:10.7554/eLife.49547>.
Beyond the three built-in models (TD, RSTD, and Utility), users
can easily derive new variants by declaring which variables are
treated as free parameters.

YuKi 

multiRL

Reinforcement Learning Tools for Multi-Armed Bandit

Xinyu 

RSTD function

<dl><dt>params</dt>
<dd>Parameters used by the model’s internal functions,
see params</dd></dl>

Arguments

<pre><code>RSTD &lt;- function(params){
 
 params &lt;- list(
 free = list(alphaN = params[1], alphaP = params[2], beta = params[3])
 )
 
 multiRL.model &lt;- multiRL::run_m(
 data = data,
 behrule = behrule,
 colnames = colnames,
 params = params,
 funcs = funcs,
 priors = priors,
 settings = settings
 )
 
 assign(x = "multiRL.model", value = multiRL.model, envir = multiRL.env)
 return(.return_result(multiRL.model))
}
</code></pre>

Body

Risk Sensitive Model — RSTD

<dl>

<dt>params</dt>
<dd>Parameters used by the model’s internal functions,
see params</dd>

</dl>

RSTD: Risk Sensitive Model

Description

Usage

Value

Arguments

Body