geomc: Markov chain Monte Carlo for discrete and continuous distributions using geometric MH algorithms.

Description

geomc produces Markov chain samples from a user-defined target, which may be either a probability density function (pdf) or a probability mass function (pmf). The target is provided as an R function returning the log of its unnormalized pdf or pmf.

Usage

geomc(
  logp,
  initial,
  n.iter,
  eps = 0.5,
  ind = FALSE,
  gaus = TRUE,
  imp = list(enabled = FALSE, n.samp = 100, samp.base = TRUE),
  a = 1,
  show.progress = TRUE,
  ...
)

Value

The function returns a list with the following components:

samples: A matrix containing the MCMC samples. Each row is one sample.
acceptance.rate: The Metropolis-Hastings acceptance rate.
log.p: A vector with the logarithm of the (un-normalized) target density for each MCMC sample.
gaus: The value of the logical gaus.
ind: The value of the logical ind.
model.case: Describes whether default settings are used for both functions $f$ and $g$, for only one of them, or for neither.
var.base: The default variance used for the random-walk base density if not provided by the user.
mean.ap.tar: The default mean vector used for the approximate target density if not provided by the user.
var.ap.tar: The default covariance matrix used for the approximate target density if not provided by the user.

Arguments

logp

Either a function that evaluates the logarithm of the un-normalized target density (pdf or pmf) to sample from, or a list containing at least one element named log.target. The list may optionally include any of the following elements specifying additional functions for the base and approximate target densities: mean.base, var.base, dens.base, samp.base, mean.ap.tar, var.ap.tar, dens.ap.tar, samp.ap.tar, and bhat.coef. See below for details of these functions. Any optional elements not provided are treated as missing, and default behavior is applied.

log.target A function that evaluates the logarithm of the un-normalized target density (pdf or pmf). Its first argument must be the current state vector $x$, and it must return a numeric scalar. If the target density requires additional parameters, the user-supplied log.target must be written to accept them through ...
mean.base is the mean of the base density $f$, needs to be written as a function of the current state $x$.
var.base is the covariance matrix of the base density $f$, needs to be written as a function of the current state $x$.
dens.base is the density function of the base density $f$, needs to be written as a function $(y,x)$ (in this order) of the proposed state $y$ and the current state $x$, although it may not depend on $x$.
samp.base is the function to draw from the base density $f$, needs to be written as a function of the current state $x$.
mean.ap.tar is the vector of means of the densities $g_i(y|x), i=1,\dots,k$. It needs to be written as a function of the current state $x$. It must have the same dimension as $k$ times the length of initial.
var.ap.tar is the matrix of covariance matrices of the densities $g_i(y|x), i=1,\dots,k$ formed by column concatenation. It needs to be written as a function of the current state $x$. It must have the same dimension as the length of initial by $k$ times the length of initial.
dens.ap.tar is the vector of densities $g_i(y|x), i=1,\dots,k$. It needs to be written as a function $(y,x)$ (in this order) of the proposed state $y$ and the current state $x$, although it may not depend on $x$.
samp.ap.tar is the function to draw from the densities $g_i(y|x), i=1,\dots,k$. It needs to be written as a function of (current state $x$, the indicator of component $kk$). It must return a vector of the length of that of the initial.
bhat.coef is the vector of Bhattacharyya coefficients $\langle \sqrt{f(\cdot|x)}, \sqrt{g_i(\cdot|x)} \rangle, i=1,\dots,k$. It needs to be written as a function of the current state $x$. When gaus= TRUE, the bhat.coef argument is ignored and the built-in function is used.

initial

is the initial state.

n.iter

is the no. of samples needed.

eps

is the value for epsilon perturbation. Default is 0.5.

ind

is False if either the base density, $f$ or the approximate target density, $g$ depends on the current state $x$. Default is False.

gaus

is True if both $f$ and $g$ are normal distributions. Default is True.

imp

A list of three elements controlling optional importance sampling. This list has components:

enabled Logical. If FALSE (default), numerical integration is used to compute the Bhattacharyya coefficient $\langle \sqrt{f}, \sqrt{g_i} \rangle$. If TRUE, importance sampling is used instead.
n.samp A positive integer giving the number of Monte Carlo samples used when enabled = TRUE.
samp.base Logical. If TRUE, the samples in the importance sampler are drawn from the base density $f$; otherwise they are drawn from the approximate target density $g$. Default is FALSE.

When gaus = TRUE, the imp argument is ignored. Also, when bhat.coef is provided, the imp argument is ignored.

a

is the probability vector for the mixture proposal density. Default is the uniform distribution.

show.progress

Logical. Whether to show progress during sampling. Default: TRUE.

...

additional arguments passed to log.target

Author

Vivekananda Roy vroy@iastate.edu

Details

The function geomc implements the geometric approach of Roy (2024) to move an initial (possibly uninformed) base density $f$ toward one or more approximate target densities $g_i, i=1,\dots,k$, thereby constructing efficient proposal distributions for MCMC. The details of the method can be found in Roy (2024).

The base density $f$ can be user-specified through its mean, covariance matrix, density function, and sampling function. One or more approximate target densities $g_i, i=1,\dots,k$ can also be supplied. Just like the base density, each approximate target may be specified through its mean, covariance, density, and sampling functions.

A Gaussian ($f$ or $g_i$) density must be specified in terms of its mean and covariance; optional density and sampling functions may also be supplied.

Non-Gaussian densities, including discrete pmfs for either the base or approximate target, are specified solely via their density and sampling functions. If for either the base or the approximate target, the user specifies both a density function and a sampling function but omits either the mean function or the covariance function, then gaus = FALSE is automatically enforced.

If either or both of the mean and variance arguments and any of the density and sampling functions is omitted for the base density, geomc automatically constructs a base density corresponding to a random-walk proposal with an appropriate scale. If either or both of the mean and variance arguments and any of the density and sampling functions is missing for the approximate target density, then geomc automatically constructs a diffuse multivariate normal distribution as the approximate target.

If the target distribution is a pmf (i.e., a discrete distribution) then the user must provide the base pmf $f$ and one or more $g_i$'s. The package vignette vignette(package="geommc") provides several useful choices for $f$ and $g_i$. In addition, the user must set gaus=FALSE and either supply bhat.coef or set imp$enabled=TRUE (neither of which is the default for gaus or imp). This ensures that the algorithm uses either the user defined bhat.coef function or the importance sampling, rather than numerical integration or closed-form Gaussian expressions, for computing the Bhattacharyya coefficient $\langle \sqrt{f}, \sqrt{g_i} \rangle$ for discrete distributions.

For a discussion and several illustrative examples of the geomc function, see the package vignette vignette(package="geommc").

References

Roy, V.(2024) A geometric approach to informative MCMC sampling https://arxiv.org/abs/2406.09010

Examples

Run this code

#target is multivariate normal, sampling using geomc with default choices
log_target_mvnorm <- function(x, target.mean, target.Sigma) {d  <- length(x)
xc <- x - target.mean; Q  <- solve(target.Sigma); -0.5 * drop(t(xc) %*% Q %*% xc)}
result <- geomc(logp=log_target_mvnorm,initial= c(0, 0),n.iter=500, target.mean=c(1, -2),
               target.Sigma=matrix(c(1.5, 0.7,0.7, 2.0), 2, 2))
               # additional arguments passed via ...
result$samples # the MCMC samples.
result$acceptance.rate # the acceptance rate.
result$log.p # the value of logp at the MCMC samples.
#Additional returned values are
result$var.base; result$mean.ap.tar; result$var.ap.tar; result$model.case; result$gaus; result$ind
#target is the posterior of (\eqn{\mu}, \eqn{\sigma^2}) with iid data from 
#N(\eqn{\mu}, \eqn{\sigma^2}) and N(mu0, \eqn{tau0^2}) prior on \eqn{\mu} 
#and an inverse–gamma (alpha0, beta0) prior on \eqn{\sigma^2}
log_target <- function(par, x, mu0, tau0, alpha0, beta0) {
mu  <- par[1];sigma2 <- par[2]
if (sigma2 <= 0) return(-Inf)
n  <- length(x); SSE <- sum((x - mu)^2)
val <- -(n/2) * log(sigma2) - SSE / (2 * sigma2) - (mu - mu0)^2 / (2 * tau0^2)
-(alpha0 + 1) * log(sigma2) -beta0 / sigma2
return(val)}
# sampling using geomc with default choices
result=geomc(logp=log_target,initial=c(0,1),n.iter=1000,x=1+rnorm(100),mu0=0,          
tau0=1,alpha0=2.01,beta0=1.01)
colMeans(result$samples)
#target is mixture of univariate normals, sampling using geomc with default choices
set.seed(3);result<-geomc(logp=list(log.target=function(y) 
log(0.5*dnorm(y)+0.5*dnorm(y,mean=10,sd=1.4))),
initial=0,n.iter=1000) 
hist(result$samples)
#target is mixture of univariate normals, sampling using geomc with a random walk base density,
# and an informed choice for dens.ap.tar
result<-geomc(logp=list(log.target=function(y) log(0.5*dnorm(y)+0.5*dnorm(y,mean=10,sd=1.4)),
mean.base = function(x) x,
var.base= function(x) 4, dens.base=function(y,x) dnorm(y, mean=x,sd=2),
samp.base=function(x) x+2*rnorm(1),
mean.ap.tar=function(x) c(0,10),var.ap.tar=function(x) c(1,1.4^2),
dens.ap.tar=function(y,x) c(dnorm(y),dnorm(y,mean=10,sd=1.4)),
samp.ap.tar=function(x,kk=1){if(kk==1){return(rnorm(1))} else{return(10+1.4*rnorm(1))}}),
initial=0,n.iter=500)
hist(result$samples)
#target is mixture of univariate normals, sampling using geomc with a random walk base density, 
# and another informed choice for dens.ap.tar
samp.ap.tar=function(x,kk=1){s.g=sample.int(2,1,prob=c(.5,.5))
if(s.g==1){return(rnorm(1))
}else{return(10+1.4*rnorm(1))}}
result<-geomc(logp=list(log.target=function(y) log(0.5*dnorm(y)+0.5*dnorm(y,mean=10,sd=1.4)),
dens.base=function(y,x) dnorm(y, mean=x,sd=2),samp.base=function(x) x+2*rnorm(1),
dens.ap.tar=function(y,x) 0.5*dnorm(y)+0.5*dnorm(y,mean=10,sd=1.4), samp.ap.tar=samp.ap.tar),
initial=0,n.iter=500,gaus=FALSE,imp=list(enabled=TRUE,n.samp=100,samp.base=TRUE))
hist(result$samples)
#target is mixture of bivariate normals, sampling using geomc with random walk base density,
# and an informed choice for dens.ap.tar
log_target_mvnorm_mix <- function(x, mean1, Sigma1, mean2, Sigma2) {
return(log(0.5*exp(geommc:::ldens_mvnorm(x, mean1,Sigma1))+
0.5*exp(geommc:::ldens_mvnorm(x,mean2,Sigma2))))}
result <- geomc(logp=list(log.target=log_target_mvnorm_mix, mean.base = function(x) x, 
var.base= function(x) 2*diag(2), mean.ap.tar=function(x) c(0,0,10,10),
var.ap.tar=function(x) cbind(diag(2),2*diag(2))),initial= c(5, 5),n.iter=500, mean1=c(0, 0), 
Sigma1=diag(2), mean2=c(10, 10), Sigma2=2*diag(2))
colMeans(result$samples)
#While the geomc with random walk base successfully moves back and forth between the two modes,
# the random walk Metropolis is trapped in a mode, as run below 
result<-geommc:::rwm(log_target= function(x) log_target_mvnorm_mix(x,c(0, 0), diag(2), 
c(10, 10), 2*diag(2)), initial= c(5, 5), n_iter = 500, sig = 2,return_sample = TRUE)
colMeans(result$samples)
 #target is binomial, sampling using geomc
size=5
result <- geomc(logp=list(log.target = function(y) dbinom(y, size, 0.3, log = TRUE),
dens.base=function(y,x) 1/(size+1), samp.base= function(x) sample(seq(0,size,1),1),
dens.ap.tar=function(y,x) dbinom(y, size, 0.7),samp.ap.tar=function(x,kk=1) rbinom(1, size, 0.7)),
initial=0,n.iter=500,ind=TRUE,gaus=FALSE,imp=list(enabled=TRUE,n.samp=1000,samp.base=TRUE))
 table(result$samples)

Run the code above in your browser using DataLab