reward

a solved <a rd-options="" href="/link/POMDP?package=pomdp&version=1.0.0" data-mini-rdoc="pomdp::POMDP">POMDP</a> object.

specification of the current belief state (see argument start
in <a rd-options="" href="/link/POMDP?package=pomdp&version=1.0.0" data-mini-rdoc="pomdp::POMDP">POMDP</a> for details). By default the belief state defined in
the model as start is used.

belief

return reward for this epoch. Default is the first epoch.

epoch

This function calculates the expected total reward for a POMDP solution
given a starting belief state.

Provides the infrastructure to define and analyze the solutions of Partially Observable Markov Decision Process (POMDP) models. Interfaces for various exact and approximate solution algorithms are available including value iteration, point-based value iteration and SARSOP. Smallwood and Sondik (1973) <doi:10.1287/opre.21.5.1071>.

Michael Hahsler

pomdp

Solver for Partially Observable Markov Decision Processes
(POMDP)

Hossein Kamalzadeh

reward function

a solved <a rd-options='' href='POMDP'>POMDP</a> object.

specification of the current belief state (see argument start
in <a rd-options='' href='POMDP'>POMDP</a> for details). By default the belief state defined in
the model as start is used.

reward: Calculate the Reward for a POMDP Solution

Description

Usage

Arguments

Value

Details

See Also

Examples