blm_star_exact_bnp

Compute direct Monte Carlo samples from the posterior and predictive
distributions of a STAR linear regression model with a g-prior
and Bayesian nonparametric (BNP) transformation.

internal

For Bayesian and classical inference and prediction with count-valued data,
Simultaneous Transformation and Rounding (STAR) Models provide a flexible, interpretable,
and easy-to-use approach. STAR models the observed count data using a rounded
continuous data model and incorporates a transformation for greater flexibility.
Implicitly, STAR formalizes the commonly-applied yet incoherent procedure of
(i) transforming count-valued data and subsequently
(ii) modeling the transformed data using Gaussian models.
STAR is well-defined for count-valued data, which is reflected in predictive accuracy,
and is designed to account for zero-inflation, bounded or censored data, and over- or underdispersion.
Importantly, STAR is easy to combine with existing MCMC or point estimation
methods for continuous data, which allows seamless adaptation of continuous data
models (such as linear regressions, additive models, BART, random forests,
and gradient boosting machines) for count-valued data. The package also includes several
methods for modeling count time series data, namely via warped Dynamic Linear Models.
For more details and background on these methodologies, see the works of
Kowal and Canale (2020) <doi:10.1214/20-EJS1707>,
Kowal and Wu (2022) <doi:10.1111/biom.13617>,
King and Kowal (2023) <doi:10.1214/23-BA1394>, and
Kowal and Wu (2023) <doi:10.48550/arXiv.2110.12316>.

Brian King

countSTAR

Flexible Modeling of Count Data

Dan Kowal

blm_star_exact_bnp function

<dl><dt>y</dt>
<dd><code>n x 1</code> vector of observed counts</dd>
<dt>X</dt>
<dd><code>n x p</code> matrix of predictors (including an intercept)</dd>
<dt>X_test</dt>
<dd><code>n_test x p</code> matrix of predictors for test data
(including an intercept); default is the observed covariates <code>X</code></dd>
<dt>y_max</dt>
<dd>a fixed and known upper bound for all observations; default is <code>Inf</code></dd>
<dt>psi</dt>
<dd>prior variance (g-prior); default is <code>n</code></dd>
<dt>alpha</dt>
<dd>prior precision for the Dirichlet Process prior; default is one</dd>
<dt>P0</dt>
<dd>function to evaluate the base measure PMF supported on <code>{0,...,y_max}</code>;
see below for default values when unspecified (<code>NULL</code>)</dd>
<dt>pilot_run</dt>
<dd>logical; if <code>TRUE</code>, use a short pilot run to approximate
the marginal CDF of the latent <code>z</code>; otherwise, use a Laplace approximation</dd>
<dt>nsave</dt>
<dd>number of Monte Carlo iterations to save</dd></dl>

Arguments

Monte Carlo sampler for STAR linear regression with BNP transformation — blm_star_exact_bnp

<dl>

<dt>y</dt>
<dd><code>n x 1</code> vector of observed counts</dd>


<dt>X</dt>
<dd><code>n x p</code> matrix of predictors (including an intercept)</dd>


<dt>X_test</dt>
<dd><code>n_test x p</code> matrix of predictors for test data
(including an intercept); default is the observed covariates <code>X</code></dd>


<dt>y_max</dt>
<dd>a fixed and known upper bound for all observations; default is <code>Inf</code></dd>


<dt>psi</dt>
<dd>prior variance (g-prior); default is <code>n</code></dd>


<dt>alpha</dt>
<dd>prior precision for the Dirichlet Process prior; default is one</dd>


<dt>P0</dt>
<dd>function to evaluate the base measure PMF supported on <code>{0,...,y_max}</code>;
see below for default values when unspecified (<code>NULL</code>)</dd>


<dt>pilot_run</dt>
<dd>logical; if <code>TRUE</code>, use a short pilot run to approximate
the marginal CDF of the latent <code>z</code>; otherwise, use a Laplace approximation</dd>


<dt>nsave</dt>
<dd>number of Monte Carlo iterations to save</dd>

</dl>

blm_star_exact_bnp: Monte Carlo sampler for STAR linear regression with BNP transformation

Description

Usage

Value

Arguments

Details