Learn R Programming

multiRL (version 0.4.5)

Reinforcement Learning Tools for Multi-Armed Bandit

Description

A flexible general-purpose toolbox for implementing Rescorla-Wagner models in multi-armed bandit tasks. As the successor and functional extension of the 'binaryRL' package, 'multiRL' modularizes the Markov Decision Process (MDP) into six core components. This framework enables users to construct custom models via intuitive if-else syntax and define latent learning rules for agents. For parameter estimation, it provides both likelihood-based inference (MLE and MAP) and simulation-based inference (ABC and RNN), with full support for parallel processing across subjects. The workflow is highly standardized, featuring four main functions that strictly follow the four-step protocol (and ten rules) proposed by Wilson & Collins (2019) . Beyond the three built-in models (TD, RSTD, and Utility), users can easily derive new variants by declaring which variables are treated as free parameters.

Copy Link

Version

Install

install.packages('multiRL')

Monthly Downloads

149

Version

0.4.5

License

GPL-3

Maintainer

YuKi

Last Published

June 9th, 2026

Functions in multiRL (0.4.5)

The Engine of Recurrent Neural Network (RNN)

Estimation Method: Maximum Likelihood Estimation (MLE)

Estimation Method: Recurrent Neural Network (RNN)

Function: Decay Rate

Simulated-Based Inference (SBI)

Function: Utility

Function: Learning Rate

Function: Exploration or Exploitation

Function: Probability

estimation_methods

Estimate Methods

Step 3: Optimizing parameters to fit real data

process_4_output_cpp

process_1_input

Density and Random Function

process_2_behrule

multiRL.behrule

process_3_record

plot.multiRL.replay

plot.multiRL.replay

Layers and Loss Functions (RNN)

Policy of Agent

multiRL-package

multiRL: Reinforcement Learning Tools for Multi-Armed Bandit

Model Parameters

Step 1: Building reinforcement learning model

Step 4: Replaying the experiment with optimal parameters

Dimension Reduction Methods (ABC)

process_5_metric

process_4_output_r

summary,multiRL.model-method

Step 2: Generating fake data for parameter and model recovery

Cognitive Processing System

Settings of Model

Group 2 from Mason et al. (2024)

Data from Collins and Frank (2012)

Temporal Differences Model

Simulated Multi-Arm Bandit Dataset

Controls of Estimation Methods

Algorithm Packages (MLE, MAP)

Risk Sensitive Model

The Engine of Approximate Bayesian Computation (ABC)

Estimation Method: Maximum A Posteriori (MAP)

Dataset Structure

Estimation Method: Approximate Bayesian Computation (ABC)

Likelihood-Based Inference (LBI)

Estimate Methods

Tool for Generating an Environment for Models