blm_star_exact

Compute direct Monte Carlo samples from the posterior and predictive
distributions of a STAR linear regression model with a g-prior.

internal

For Bayesian and classical inference and prediction with count-valued data,
Simultaneous Transformation and Rounding (STAR) Models provide a flexible, interpretable,
and easy-to-use approach. STAR models the observed count data using a rounded
continuous data model and incorporates a transformation for greater flexibility.
Implicitly, STAR formalizes the commonly-applied yet incoherent procedure of
(i) transforming count-valued data and subsequently
(ii) modeling the transformed data using Gaussian models.
STAR is well-defined for count-valued data, which is reflected in predictive accuracy,
and is designed to account for zero-inflation, bounded or censored data, and over- or underdispersion.
Importantly, STAR is easy to combine with existing MCMC or point estimation
methods for continuous data, which allows seamless adaptation of continuous data
models (such as linear regressions, additive models, BART, random forests,
and gradient boosting machines) for count-valued data. The package also includes several
methods for modeling count time series data, namely via warped Dynamic Linear Models.
For more details and background on these methodologies, see the works of
Kowal and Canale (2020) <doi:10.1214/20-EJS1707>,
Kowal and Wu (2022) <doi:10.1111/biom.13617>,
King and Kowal (2023) <doi:10.1214/23-BA1394>, and
Kowal and Wu (2023) <doi:10.48550/arXiv.2110.12316>.

Brian King

countSTAR

Flexible Modeling of Count Data

Dan Kowal

blm_star_exact function

<dl><dt>y</dt>
<dd><code>n x 1</code> vector of observed counts</dd>
<dt>X</dt>
<dd><code>n x p</code> matrix of predictors</dd>
<dt>X_test</dt>
<dd><code>n_test x p</code> matrix of predictors for test data</dd>
<dt>transformation</dt>
<dd>transformation to use for the latent data; must be one of<ul>
<li>"identity" (identity transformation)</li>
<li>"log" (log transformation)</li>
<li>"sqrt" (square root transformation)</li>
<li>"np" (nonparametric transformation estimated from empirical CDF)</li>
<li>"pois" (transformation for moment-matched marginal Poisson CDF)</li>
<li>"neg-bin" (transformation for moment-matched marginal Negative Binomial CDF)</li>
</ul></dd>
<dt>y_max</dt>
<dd>a fixed and known upper bound for all observations; default is <code>Inf</code></dd>
<dt>psi</dt>
<dd>prior variance (g-prior)</dd>
<dt>nsave</dt>
<dd>number of Monte Carlo simulations</dd>
<dt>compute_marg</dt>
<dd>logical; if TRUE, compute and return the
marginal likelihood</dd></dl>

Arguments

Monte Carlo sampler for STAR linear regression with a g-prior — blm_star_exact

<dl>

<dt>y</dt>
<dd><code>n x 1</code> vector of observed counts</dd>


<dt>X</dt>
<dd><code>n x p</code> matrix of predictors</dd>


<dt>X_test</dt>
<dd><code>n_test x p</code> matrix of predictors for test data</dd>


<dt>transformation</dt>
<dd>transformation to use for the latent data; must be one of<ul>
<li>"identity" (identity transformation)</li>
<li>"log" (log transformation)</li>
<li>"sqrt" (square root transformation)</li>
<li>"np" (nonparametric transformation estimated from empirical CDF)</li>
<li>"pois" (transformation for moment-matched marginal Poisson CDF)</li>
<li>"neg-bin" (transformation for moment-matched marginal Negative Binomial CDF)</li>
</ul></dd>


<dt>y_max</dt>
<dd>a fixed and known upper bound for all observations; default is <code>Inf</code></dd>


<dt>psi</dt>
<dd>prior variance (g-prior)</dd>


<dt>nsave</dt>
<dd>number of Monte Carlo simulations</dd>


<dt>compute_marg</dt>
<dd>logical; if TRUE, compute and return the
marginal likelihood</dd>

</dl>

blm_star_exact: Monte Carlo sampler for STAR linear regression with a g-prior

Description

Usage

Value

Arguments

Details