Deprecated. Please use [ReinforcementLearning::replayExperience()] instead.
experienceReplay(D, Q, control, ...)A dataframe containing the input data for reinforcement learning.
Each row represents a state transition tuple (s,a,r,s_new).
Existing state-action table of type hash.
Control parameters defining the behavior of the agent.
Additional parameters passed to function.
Returns an object of class hash that contains the learned Q-table.
Lin (1992). "Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching", Machine Learning (8:3), pp. 293--321.
Watkins (1992). "Q-learning". Machine Learning (8:3), pp. 279--292.