transition_matrix

transition_prob

observation_matrix

observation_prob

reward_matrix

reward_val

A <a rd-options="" href="/link/POMDP?package=pomdp&version=1.0.2" data-mini-rdoc="pomdp::POMDP">POMDP</a> object.

Episode used for time-dependent POMDPs (<a rd-options="" href="/link/POMDP?package=pomdp&version=1.0.2" data-mini-rdoc="pomdp::POMDP">POMDP</a>).

episode

only return the matrix/value for a given action.

action

start.state, end.state, observation

Converts the description of transition probabilities and observation
probabilities in a POMDP into a list of matrices. Individual values or parts of the matrices
can be more efficiently retrieved using the functions ending <code>_prob</code> and <code>_val</code>.

Provides the infrastructure to define and analyze the solutions of Partially Observable Markov Decision Process (POMDP) models. Interfaces for various exact and approximate solution algorithms are available including value iteration, point-based value iteration and SARSOP. Smallwood and Sondik (1973) <doi:10.1287/opre.21.5.1071>.

Michael Hahsler

pomdp

Infrastructure for Partially Observable Markov Decision
Processes (POMDP)

Hossein Kamalzadeh

transition_matrix function

A <a rd-options='' href='POMDP'>POMDP</a> object.

Episode used for time-dependent POMDPs (<a rd-options='' href='POMDP'>POMDP</a>).

transition_matrix: Extract the Transition, Observation or Reward Information from a POMDP

Description

Usage

Arguments

Value

Details

See Also

Examples