TRUH test statistic for nonparametric two sample testing under heterogeneity.
Usage
truh(V, U, B, fc = 1, ncores = 2, seed = 1)
Arguments
V
\(m\times d\) matrix where \(m\) represents the sample size and
\(d\) is the dimension of each sample.
U
a \(n\times d\) matrix where \(n\) represents the sample size and
\(d\) is the dimension of each sample with \(m\ll n\).
B
number of bootstrap samples.
fc
fold change constant. The default value is 1. See equation (2.8) of the referenced paper for more details.
ncores
the number of computing cores available. The default value is 2.
seed
random seed for replicability. The default value is 1.
Value
teststat - TRUH test statistic.
k.hat - number of clusters detected in the uninfected sample.
pval - The maximum p-value across the detected clusters.
pval_all - p-value for each cluster.
dist.null_all - the approximate bootstrapped based null distribution.
References
Banerjee, Trambak, Bhaswar B. Bhattacharya, and Gourab Mukherjee.
"A nearest-neighbor based nonparametric test for viral remodeling in
heterogeneous single-cell proteomic data."
The Annals of Applied Statistics 14, no. 4 (2020): 1777-1805.
# NOT RUN {library(truh)
n = 500m = 10d = 3set.seed(1)
V = matrix(rnorm(m*d),nrow=m,ncol=d)
set.seed(2)
U = matrix(rnorm(n*d),nrow=n,ncol=d)
out = truh(V,U,100)
# }