blm_star_gibbs_bnp

Compute MCMC samples from the posterior and predictive
distributions of a STAR linear regression model with a g-prior
and Bayesian nonparametric (BNP) transformation.

internal

For Bayesian and classical inference and prediction with count-valued data,
Simultaneous Transformation and Rounding (STAR) Models provide a flexible, interpretable,
and easy-to-use approach. STAR models the observed count data using a rounded
continuous data model and incorporates a transformation for greater flexibility.
Implicitly, STAR formalizes the commonly-applied yet incoherent procedure of
(i) transforming count-valued data and subsequently
(ii) modeling the transformed data using Gaussian models.
STAR is well-defined for count-valued data, which is reflected in predictive accuracy,
and is designed to account for zero-inflation, bounded or censored data, and over- or underdispersion.
Importantly, STAR is easy to combine with existing MCMC or point estimation
methods for continuous data, which allows seamless adaptation of continuous data
models (such as linear regressions, additive models, BART, random forests,
and gradient boosting machines) for count-valued data. The package also includes several
methods for modeling count time series data, namely via warped Dynamic Linear Models.
For more details and background on these methodologies, see the works of
Kowal and Canale (2020) <doi:10.1214/20-EJS1707>,
Kowal and Wu (2022) <doi:10.1111/biom.13617>,
King and Kowal (2023) <doi:10.1214/23-BA1394>, and
Kowal and Wu (2023) <doi:10.48550/arXiv.2110.12316>.

Brian King

countSTAR

Flexible Modeling of Count Data

Dan Kowal

blm_star_gibbs_bnp function

<dl><dt>y</dt>
<dd><code>n x 1</code> vector of observed counts</dd>
<dt>X</dt>
<dd><code>n x p</code> matrix of predictors (including an intercept)</dd>
<dt>X_test</dt>
<dd><code>n_test x p</code> matrix of predictors for test data
(including an intercept); default is the observed covariates <code>X</code></dd>
<dt>y_max</dt>
<dd>a fixed and known upper bound for all observations; default is <code>Inf</code></dd>
<dt>psi</dt>
<dd>prior variance (g-prior); default is <code>n</code></dd>
<dt>alpha</dt>
<dd>prior precision for the Dirichlet Process prior; default is one</dd>
<dt>P0</dt>
<dd>function to evaluate the base measure PMF supported on <code>{0,...,y_max}</code>;
see below for default values when unspecified (<code>NULL</code>)</dd>
<dt>pilot_run</dt>
<dd>logical; if <code>TRUE</code>, use a short pilot run to approximate
the marginal CDF of the latent <code>z</code>; otherwise, use a Laplace approximation</dd>
<dt>nsave</dt>
<dd>number of MCMC iterations to save</dd>
<dt>nburn</dt>
<dd>number of MCMC iterations to discard</dd>
<dt>nskip</dt>
<dd>number of MCMC iterations to skip between saving iterations,
i.e., save every (nskip + 1)th draw</dd></dl>

Arguments

Gibbs sampler for STAR linear regression with BNP transformation — blm_star_gibbs_bnp

<dl>

<dt>y</dt>
<dd><code>n x 1</code> vector of observed counts</dd>


<dt>X</dt>
<dd><code>n x p</code> matrix of predictors (including an intercept)</dd>


<dt>X_test</dt>
<dd><code>n_test x p</code> matrix of predictors for test data
(including an intercept); default is the observed covariates <code>X</code></dd>


<dt>y_max</dt>
<dd>a fixed and known upper bound for all observations; default is <code>Inf</code></dd>


<dt>psi</dt>
<dd>prior variance (g-prior); default is <code>n</code></dd>


<dt>alpha</dt>
<dd>prior precision for the Dirichlet Process prior; default is one</dd>


<dt>P0</dt>
<dd>function to evaluate the base measure PMF supported on <code>{0,...,y_max}</code>;
see below for default values when unspecified (<code>NULL</code>)</dd>


<dt>pilot_run</dt>
<dd>logical; if <code>TRUE</code>, use a short pilot run to approximate
the marginal CDF of the latent <code>z</code>; otherwise, use a Laplace approximation</dd>


<dt>nsave</dt>
<dd>number of MCMC iterations to save</dd>


<dt>nburn</dt>
<dd>number of MCMC iterations to discard</dd>


<dt>nskip</dt>
<dd>number of MCMC iterations to skip between saving iterations,
i.e., save every (nskip + 1)th draw</dd>

</dl>

blm_star_gibbs_bnp: Gibbs sampler for STAR linear regression with BNP transformation

Description

Usage

Value

Arguments

Details