Learn R Programming

multiRL (version 0.2.3)

data: Dataset Structure

Description

Experimental data from any Multi-Armed Bandit (MAB)-like task.

Arguments

Class

data [data.frame]

subidblocktrialobject_1object_2object_3object_4reward_1reward_2reward_3reward_4action
111ABCD2006040A
112ABCD20406080B
113ABCD2006040C
114ABCD20406080D
........................

Details

Each row must contain all information relevant to that trial for running a decision-making task (e.g., multi-armed bandit) as well as the feedback received.

In this type of paradigm, the rewards associated with possible actions must be explicitly written in the table for every trial (aka, tabular case, see Sutton & Barto, 2018, Chapter 2).

References

Sutton, R. S., & Barto, A. G. (2018). Reinforcement Learning: An Introduction (2nd ed). MIT press.