policy_tree

The covariates used. Dimension \(Np\) where \(p\) is the number of features.

The rewards for each action. Dimension \(Nd\) where \(d\) is the number of actions.

Gamma

The depth of the fitted tree. Default is 2.

depth

An optional approximation parameter (integer above zero), the number of possible splits
to consider when performing tree search. split.step = 1 (default) considers every possible split, split.step = 10
considers splitting at every 10'th distinct value and will yield a substantial speedup for densely packed
continuous data.

split.step

Finds the optimal (maximizing the sum of rewards) depth L tree by exhaustive search. If the optimal
action is the same in both the left and right leaf of a node, the node is pruned.

Learn optimal policies via doubly robust empirical
welfare maximization over trees. This package implements the multi-action doubly
robust approach of Zhou, Athey and Wager (2018) <arXiv:1810.04778> in the case where
we want to learn policies that belong to the class of depth k decision trees.

policy_tree: Fit a policy with exact tree search

Description

Usage

Arguments

Value

References

Examples