get_betas

Creates a list of matrices representing the arm-specific reward-generating parameters (betas)
used in contextual linear bandit simulations. Each matrix corresponds to one simulation
and contains normalized random coefficients.

Performs the Cram method, a general and efficient approach to simultaneous learning and evaluation using a generic machine learning algorithm. In a single pass of batched data, the proposed method repeatedly trains a machine learning algorithm and tests its empirical performance. Because it utilizes the entire sample for both learning and evaluation, cramming is significantly more data-efficient than sample-splitting. Unlike cross-validation, Cram evaluates the final learned model directly, providing sharper inference aligned with real-world deployment. The method naturally applies to both policy learning and contextual bandits, where decisions are based on individual features to maximize outcomes. The package includes cram_policy() for learning and evaluating individualized binary treatment rules, cram_ml() to train and assess the population-level performance of machine learning models, and cram_bandit() for on-policy evaluation of contextual bandit algorithms. For all three functions, the package provides estimates of the average outcome that would result if the model were deployed, along with standard errors and confidence intervals for these estimates. Details of the method are described in Jia, Imai, and Li (2024) <https://www.hbs.edu/ris/Publication%20Files/2403.07031v1_a83462e0-145b-4675-99d5-9754aa65d786.pdf> and Jia et al. (2025) <doi:10.48550/arXiv.2403.07031>.

Yanis Vandecasteele

cramR

Cram Method for Efficient Simultaneous Learning and Evaluation

get_betas function

<dl><dt>simulations</dt>
<dd>Integer. Number of simulations.</dd>
<dt>d</dt>
<dd>Integer. Number of features (context dimensions).</dd>
<dt>k</dt>
<dd>Integer. Number of arms.</dd></dl>

Arguments

Generate Reward Parameters for Simulated Linear Bandits — get_betas

<dl>

<dt>simulations</dt>
<dd>Integer. Number of simulations.</dd>


<dt>d</dt>
<dd>Integer. Number of features (context dimensions).</dd>


<dt>k</dt>
<dd>Integer. Number of arms.</dd>

</dl>

get_betas: Generate Reward Parameters for Simulated Linear Bandits

Description

Usage

Value

Arguments