Usage
hit(x, y, hierarchy, family = "gaussian", B = 50, p.samp1 = 0.5,
sel.method = c("AF", "CV"), act.freq = 0.5, nfolds = 10,
lambda.opt = c("lambda.1se", "lambda.min"), gamma = seq(0.05, 0.99,
length.out = 100), max.p.esti = 1, mc.cores = 1L, trace = FALSE, ...)Arguments
x
Design matrix of dimension n * p, without intercept.
Variables not part of the dendrogram are added to the HO-model, see Details
below.
y
Quantitative response variable dimension n.
hierarchy
Object of class as.hierarchy. Must include
all variables of x which should be tested. family
Family of response variable distribution. Ether y is
"gaussian" in which case y must be a vector or it is "binomial"
distibuded. In this case y should be either a factor with two levels,
or a two-column matrix of
B
Number of sample-splits.
p.samp1
Fraction of data used for the LASSO. The ANOVA uses
1 - p.samp1.
sel.method
Name of method for the selection of covariates via the
LASSO, either "AF" (default), the occurrence frequenze of covariates along
the lambda path or "CV" n-fold cross-validation.
act.freq
Frequency in which a covariate has to occur along the
lambda path to be selected in the active set. Ignored if sel.method
is "CV".
nfolds
Number of folds (default is 10). Ignored if sel.method
is "AF". See cv.glmnet for more details. lambda.opt
Criterion for optimum selection of cross validated lasso.
Either "lambda.1se" (default) or "lambda.min". Ignored if sel.method
is "AF". See cv.glmnet for more details. gamma
Vector of gamma-values.
max.p.esti
Maximum alpha level. All p-values above this value are set
to one. Small max.p.esti values reduce computing time.
mc.cores
Number of cores for parallelising. Theoretical maximum is
'B'. For details see mclapply. trace
If TRUE it prints current status of the program.